Isoform 1: 

1 TTGCTCACTG CTCACCCACC TGCTGCTGCC ATGAGGCACC TTGGGGCCTT 
51 CCTCTTCCTT CTGGGGGTCC TGGGGGCCCT CACTGAGATG TGTGAAATAC 
101 CAGAGATGGA CAGCCATCTG GTAGAGAAGT TGGGCCAGCA CCTCTTACCT 
151 TGGATGGACC GGCTTTCCCT GGAGCACTTG AACCCCAGCA TCTATGTGGG 
201 CCTACGCCTC TCCAGTCTGC AGGCTGGGAC CAAGGAAGAC CTCTACCTGC 
251 ACAGCCTCAA GCTTGGTTAC CAGCAGTGCC TCCTAGGGTC TGCCTTCAGC 
301 GAG GAT G AC G GTGACTGCCA GGGCAAGCCT TCCATGGGCC AGCTGGCCCT 
351 CTACCTGCTC GCTCTCAGAG CCAACTGTGA GTTTGTCAGG GGCCACAAGG 
401 GGGACAGGCT GGTCTCACAG CTCAAATGGT TCCTGGAGGA TGAGAAGAGA 
4 51 GCCATTGACA CAGCAGCCAT GGCAGGCTTG GCATTCACCT GTCTGAAGCG 
501 CTCAAACTTC AACCCTGGTC GGAGACAACG GATCACCATG GCCATCAGAA 
551 CAGTGCGAGA GGAGATCTTG AAGGCCCAGA CCCCCGAGGG CCACTTTGGG 
601 AATGTCTACA GCACCCCATT GGCATTACAG TTCCTCATGA CTTCCCCCAT 
651 GCGTGGGGCA GAACTGGGAA CAGCATGTCT CAAGGCGAGG GTTGCTTTGC 
701 TGGCCAGTCT GCAGGATGGA GCCTTCCAGA ATGCTCTCAT GATTTCCCAG 
751 CTGCTGCCCG TTCTGAACCA CAAGACCTAC ATTGATCTGA TCTTCCCAGA 
801 CTGTCTGGCA CCACGAGTCA TGTTGGAACC AGCTGCTGAG ACCATTCCTC 
851 AGACCCAAGA GATCATCAGT GTCACGCTGC AGGTGCTTAG TCTCTTGCCG 
901 CCGTACAGAC AGTCCATCTC TGTTCTGGCC GGGTCCACCG TGGAAGATGT 
951 CCTGAAGAAG GCCCATGAGT TAGGAGGATT CACATATGAA ACACAGGCCT 
1001 CCTTGTCAGG CCCCTACTTA ACCTCCGTGA TGGGGAAAGC GGCCGGAGAA 
1051 AGGGAGTTCT GGCAGCTTCT CCGAGACCCC AACACCCCAC TGTTGCAAGG 
1101 TATTGCTGAC TACAGACCCA AGGATGGAGA AACCATTGAG CTGAGGCTGG 
1151 TTAGCTGGTA GCCCCTGAGC TCCCTCATCC CAGCAGCCTC GCACACTCCC 
1201 TAGGCTTCTA CCCTCCCTCC TGATGTCCCT GGAACAGGAA CTCGCCTGAC 
1251 CCTGCTGCCA CCTCCTGTGC ACTTTGAGCA ATGCCCCCTG GGATCACCCC 
1301 AGCCACAAGC CCTTCGAGGG CCCTATACCA TGGCCCACCT TGGAGCAGAG 
1351 AGCCAAGCAT CTTCCCTGGG AAGTCTTTCT GGCCAAGTCT GGCCAGCCTG 

14 01 GCCCTGCAGG TCTCCCATGA AGGCCACCCC ATGGTCTGAT GGGCATGAAG 
1451 CATCTCAGAC TCCTTGGCAA AAAACGGAGT CCGCAGGCCG CAGGTGTTGT 

15 01 GAAGACCACT CGTTCTGTGG TTGGGGTCCT GCAAGAAGGC CTCCTCAGCC 
1551 CGGGGGCTAT GGCCCTGACC CCAGCTCTCC ACTCTGCTGT TAGAGTGGCA 
1601 GCTCCGAGCT GGTTGTGGCA CAGTAGCTGG GGAGACCTCA GCAGGGCTGC 
1651 TCAGTGCCTG CCTCTGACAA AATTAAAGCA TTGATGGCCT GTGAAAAAAA 
1701 AAAAAAAAAA AAAAAAAAAA AA 

(SEQ ID NO:l) 

FEATURES: 

5'UTR: 1-30 

Start Codon: 31 

Stop Codon: 1159 

3'UTR: 1162 

Homologous proteins: 

Top 10 BLAST Hits 

CRAI108000024653390 /altid=gi 1 12742775 /def =ref 1 XP_009922 . 2 | tr.. 
CRAI108000024636236 /altid=gi i 298316 /def =gb ! AAB2552 6 . 1 | transc. . 
CRAj 18000004926133 /altid=gi j 339205 /def =gb I AAA61057 . 1 | (L02648.. 
CRA| 108000024042036 /altid-gi [ 12 654 67 5 / de f =gb i AAH0 117 6.11 AAH0 1 . . 
CRAi 18000004926130 /altid=gi I 4507 409 /def=ref [ NP_000346 . 1 1 tran. . 
CRA{18000004926132 /altid=gi I 339203 /def =gb | AAA61056 . 1 ! (L02647.. 
CRAI18000005170902 /altid-gi I 7 657 63 9 /def=ref [NP_056564 . 1 1 tran.. 
CRAI18000005218941 /altid=gi i 45724 54 /def-gb I AAD23829 . 1 | AF12128 . . 
CRA| 164000136745249 /altid-gi | 11968124 /def -ref I NP_07 197 9 . 1 1 tr.. 
CRA| 18000004926134 /altid=gi I 4507407 / def=ref |NP_001053 . 1 1 tran.. 



Score E 

752 0.0 

732 0.0 

732 0.0 

731 0.0 

727 0.0 

725 0.0 

515 e-145 

501 e-140 

481 e-134 

108 2e-22 
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EST: 

gi 1 10725490 /dataset-dbest /taxon=96.. 
gi 1 10947399 /dataset=dbest /taxon-96.. 
gi I 9121897 /dataset=dbest /taxon=9606. 
gi 1 13280819 /dataset=dbest /taxon-96. . 
gi 113287907 /dataset=dbest /taxon=96.. 
gi 113286505 /dataset-dbest /taxon=96.. 
gi I 8150776 /dataset=dbest /taxon=960 . . 
gi I 5936410 /dataset-dbest /taxon=9606 
gi I 6888875 /dataset^dbest /taxon=9606. 
gi J 6888872 /dataset=dbest /taxon=9606 . 



858 0.0 

846 0.0 

846 0.0 

846 0.0 

833 0.0 

831 0.0 

815 0.0 

726 0.0 

726 0.0 

726 0.0 



EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 

gi | 10725490 | adult adrenal gland 

gi | 10947399 I mammary gland 

gi | 9121897 | retinoblastoma 

gi | 13280819] adenocarcinoma cell line 

gi 1 13287907 | retinoblastoma 

gi | 13286505 | embryonal carcinoma, cell line 
gi | 8150776 | adult uterus 
gi | 5936410 | adult uterus 
gi I 6888875 | adult headjieck 
gi | 6888872 | adult headjneck 

Tissue Expression: 
Human leukocyte 
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Isoform 2: 

1 GGAGGATTAA TCAGTGACAG GAAGCTGCGT CTCTCGGAGC GGTGACCAGC 
51 TGTGGTCAGG AGAGCCTCAG CAGGGCCAGC CCCAGGAGTC TTTCCCGATT 
101 CTTGCTCACT GCTCACCCAC CTGCTGCTGC CAT GAG GC AC CTTGGGGCCT 
151 TCCTCTTCCT TCTGGGGGTC CTGGGGGCCC TCACTGAGAT GTGTGAAATA 
201 CCAGAGATGG ACAGCCATCT GG TAG AG AAG TTGGGCCAGC ACCTCTTACC 
251 TTGGATGGAC CGGCTTTCCC TGGAGCACTT GAACCCCAGC ATCTATGTGG 
301 GCCTACGCCT CTCCAGTCTG CAGGCTGGGA CCAAGGAAGA CCTCTACCTG 
351 CACAGCCTCA TGCTTGGTTA CCAGCAGTGC CTCCTAGGGT CTGCCTTCAG 
4 01 C GAG GAT G AC GGTGACTGCC AGGGCAAGCC TTCCATGGGC CAGCTGGCCC 
4 51 TCTACCTGCT CGCTCTCAGA GCCAACTGGC ATGATCACAA GGGCCACCCC 
501 CACACTAGCT ACTACCAGTA TGGCCTGGGC ATTCTGGCCC TGTGTCTCCA 
551 CCAGAAGCGG GTCCATGACA GCGTGGTGGA CAAACTTCTG TATGCTGTGG 
601 AACCTTTCCA CCAGGGCCAC CATTCTGTGG ACACAGCAGC CATGGCAGGC 
651 TTGGCATTCA CCTGTCTGAA GCGCTCAAAC TTCAACCCTG GTCGGAGACA 
701 ACGGATCACC ATGGCCATCA GAACAGTGCG AG AG G AG AT C TTGAAGGCCC 

7 51 AGACCCCCGA GGGCCACTTT GGGAATGTCT ACAGCACCCC ATTGGCATTA 
801 CAGTTCCTCA TGACTTCCCC CATGCGTGGG GCAGAACTGG GAACAGCATG 

8 51 TCTCAAGGCG AGGGTTGCTT TGCTGGCCAG TCTGCAGGAT GGAGCCTTCC 
901 AGAATGCTCT CATGATTTCC CAGCTGCTGC CCGTTCTGAA CCACAAGACC 
951 TACATTGATC TGATCTTCCC AGACTGTCTG GCACCACGAG TCATGTTGGA 

1001 ACCAGCTGCT GAGACCATTC CTCAGACCCA AGAGATCATC AGTGTCACGC 
1051 TGCAGGTGCT TAGTCTCTTG CCGCCGTACA GACAGTCCAT CTCTGTTCTG 
1101 GCCGGGTCCA CCGTGGAAGA TGTCCTGAAG AAGGCCCATG AGTTAGGAGG 
1151 ATTCACATAT GAAACACAGG CCTCCTTGTC AGGCCCCTAC TTAACCTCCG 
1201 TGATGGGGAA AGCGGCCGGA GAAAGGGAGT TCTGGCAGCT TCTCCGAGAC 
1251 CCCAACACCC CACTGTTGCA AGGTATTGCT GACTACAGAC CCAAGGATGG 
1301 AGAAACCATT GAGCTGAGGC TGGTTAGCTG GTAGCCCCTG AGCTCCCTCA 
1351 TCCCAGCAGC CTCGCACACT CCCTAGGCTT CTACCCTCCC TCCTGATGTC 
1401 CCTGGAACAG GAACTCGCCT GACCCTGCTG CCACCTCCTG TGCACTTTGA 
1451 GCAATGCCCC CTGGGATCAC CCCAGCCACA AGCCCTTCGA GGGCCCTATA 
1501 CCATGGCCCA CCTTGGAGCA GAGAGCCAAG CATCTTCCCT GGGAAGTCTT 
1551 TCTGGCCAAG TCTGGCCAGC CTGGCCCTGC AGGTCTCCCA TGAAGGCCAC 
1601 CCCATGGTCT GATGGGCATG AAGCATCTCA GACTCCTTGG CAAAAAACGG 
1651 AGTCCGCAGG CCGCAGGTGT TGTGAAGACC ACTCGTTCTG TGGTTGGGGT 
17 01 CCTGCAAGAA GGCCTCCTCA GCCCGGGGGC TATGGCCCTG ACCCCAGCTC 
1751 TCCACTCTGC TGTTAGAGTG GCAGCTCCGA GCTGGTTGTG GCACAGTAGC 
1801 TGGGGAGACC TCAGCAGGGC TGCTCAGTGC CTGCCTCTGA CAAAATTAAA 
1851 GCATTGATGG CCTGTGAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 
(SEQ ID NO:2) 

FEATURES : 

5'UTR: 1 - 131 

Start Codon: 132 

Stop Codon: 1332 

3 f UTR: 1335 

Homologous proteins: 

Top 10 BLAST Hits 

CRA| 108000024636236 /altid=gi i 298316 /def=gb I AAB25526. 1 1 transc. 
CRA| 108000024653390 /altid=gi 1 12742775 /def =ref I XP_009922 . 2 | tr.. 
CRAI18000004926133 /altid=gi | 339205 /def=gb I AAA61057 . 1 | (L02648.. 
CRAS 108000024042036 /altid=gi I 12654675 /def=gb I AAH0117 6 . 1 i AAH01 . . 
CRA| 18000004926130 /altid=gi I 4507409 /def=ref I NP_00034 6 . 1 1 tran. . 
CRA 1 18000004926132 /altid=gi I 339203 /def=gb I AAA61056 . 1 1 (L02647.. 
CRAi 18000005170902 /altid=gi [7 657 639 /def=ref |NP_056564 . 1 1 tran.. 
CRAU64000136745249 /altid-gi 1 11968124 /def=ref I NP_071979 . 1 ! tr.. 
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Score E 

793 0.0 

793 0.0 

7 92 0.0 

792 0.0 

788 0.0 

786 0.0 

561 e-159 

554 e-156 



CRAi 18000005218941 /altid=gi 1 4 572454 /def =gb I AAD2382 9 . 1 1 AF12128 . . . 545 e-154 
CRAI18000004926134 /altid=gi I 4507407 / def-ref I NP_001053 . 1 1 trail... 128 le-28 

EST: 

gii 10725490 /dataset-dbest /taxon-96... 
gi 15936410 /dataset-dbest /taxon=9606 ... 
gi I 6888875 /dataset=dbest /taxon=9606 . . . 
gi | 6888872 /dataset-dbest /taxon-9606. . . 
gi 1 12258937 /dataset-dbest /taxon-960 . . . 
gi 1 10947399 /dataset=dbest /taxon=96... 
gi I 13287907 /dataset-dbest /taxon=96... 
gi | 9121897 /dataset=dbest /taxon=9606 . . . 
gi | 13280819 /dataset=dbest /taxon=96... 
gi | 8150776 /dataset-dbest /taxon-960 . . . 

EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 

gi 1 10725490 | adult adrenal gland 
gi 1 5936410 | adult uterus 
gi | 6888875 i adult head_neck 
gi | 6888872 | adult headjneck 
gi | 12258937 | adult lung__tumor 
gi | 10947399 | mammary gland 
gi | 13287907 | retinoblastoma 
gi | 9121897 | retinoblastoma 
gi | 13280819 | adenocarcinoma cell line 
gi | 8150776 | 

Tissue Expression: 
Human hippocampus 



858 0.0 

835 0.0 

726 0.0 

726 0.0 

686 0.0 

680 0.0 

680 0.0 

680 0.0 

680 0.0 

656 0.0 
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Isoform 1: 

1 MRHLGAFLFL LGVLGALTEM CEIPEMDSHL VEKLGQHLLP WMDRLSLEHL 
51 NPSIYVGLRL SSLQAGTKED LYLHSLKLGY QQCLLGSAFS EDDGDCQGKP 
101 SMGQLALYLL ALRANCEFVR GHKGDRLVSQ LKWFLEDEKR AIDTAAMAGL 
151 AFTCLKRSNF NPGRRQRITM AIRTVREEIL KAQTPEGHFG NVYSTPLALQ 
201 FLMTSPMRGA ELGTACLKAR VALLASLQDG AFQNALMISQ LLPVLNHKTY 
251 IDLIFPDCLA PRVMLEPAAE TIPQTQEIIS VTLQVLSLLP PYRQSISVLA 
301 GSTVEDVLKK AHELGGFTYE TQASLSGPYL TSVMGKAAGE RE FWQLLRDP 
351 NTPLLQGIAD YRPKDGETIE LRLVSW 
(SEQ ID NO: 3) 

FEATURE S : 

Functional domains and key regions: 

PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 
Number of matches : 2 

1 75-77 SLK 

2 174-176 TVR 

PDOC00006 PS00006 CK2_PH0SPH0_SITE 
Casein kinase II phosphorylation site 
Number of matches: 6 

1 67-70 TKED 

2 90-93 SEDD 

3 174-177 TVRE 

4 226-229 SLQD 

5 249-252 TYID 

6 302-305 STVE 

PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 
Number of matches: 7 

1 12-17 GVLGAL 

2 57-62 GLRLSS 

3 86-91 GSAFSE 

4 14 9-154 GLAFTC 

5 190-195 GNVYST 

6 209-214 GAELGT 

7 230-235 GAFQNA 

PDOC00009 PS00009 AMIDATION 
Amidation site 

162-165 PGRR 

SignalP results: 

Measure Position Value Cutoff Conclusion 

max. C 19 0.602 0.37 YES 

max. Y 19 0.702 0.34 YES 

max. S 5 0.974 0.88 YES 

mean S 1-18 0.949 0.48 YES 

Most likely cleavage site between pos . 18 and 19: ALT-EM 

BLAST Alignment to Top Hit: 

>CRA[108000024636236 /altid=gi | 298316 / de f =gb j AAB2 552 6. 1 1 

transcobalamin II, TC II [human, endothelial cells, 
Peptide, 427 aa] /org=human /taxon=9606 /dataset=nraa 
/length=427 
Length - 427 

Score - 732 bits (1870), Expect = 0.0 

Identities - 376/427 (88%), Positives = 376/427 (88%), Gaps = 51/427 (11%) 
Frame = +1 
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Query: 31 MRHLGAFLFLLGVLGALTEMCEIPEMDSHLVEKLGQHLLPWMDRLSLEHLNPSIYVGLRL 210 

MRHLGAFLFLLGVLGALTEMCEIPEMDSHLVEKLGQHLLPWMDRLSLEHLNPSIYVGLRL 
Sbjct: 1 MRHLGAFLFLLGVLGALTEMCEIPEMDSHLVEKLGQHLLPWMDRLSLEHLNPSIYVGLRL 60 

Query: 211 SSLQAGTKEDLYLHSLKLGYQQCLLGSAFSEDDGDCQGKPSMGQLALYLLALRANCEFVR 390 

SSLQAGTKEDLYLHSLKLGYQQCLLGSAFSEDDGDCQGKPSMGQLALYLLALRANCEFVR 
Sbjct: 61 SSLQAGTKEDLYLHSLKLGYQQCLLGSAFSEDDGDCQGKPSMGQLALYLLALRANCEFVR 120 

Query: 391 GHKGDRLVSQLKWFLEDEKRAI 456 

GHKGDRLVSQLKWFLEDEKRAI 
Sbjct: 121 GHKGDRLVSQLKWFLEDEKRAIGHDHKGHPHTSYYQYGLGILALCLHQKRVHDSVVDKLL 180 

Query: 457 DTAAMAGLAFTCLKRSNFNPGRRQRITMAIRTVREEILKAQTPEGHF 5 97 

DTAAMAGLAFTCLKRSNFNPGRRQRITMAIRTVREEILKAQTPEGHF 
Sbjct: 181 YAVE PFHQGHHSVDTAAMAGLAFTCLKRSN FNPGRRQRITMAIRTVREE I LKAQT PEGHF 240 

Query: 598 GNVYSTPLALQFLMTSPMRGAELGTACLKARVALLASLQDGAFQNALMISQLLPVLNHKT 777 

GNVYSTPLALQFLMTSPMRGAELGTACLKARVALLASLQDGAFQNALMISQLLPVLNHKT 
Sbjct: 241 GNVYSTPLALQFLMTSPMRGAELGTACLKARVALLASLQDGAFQNALMISQLLPVLNHKT 300 

Query: 778 YI DLIFPDCLAPRVMLEPAAETI PQTQEI I SVTLQVLSLLPPYRQS I SVLAGSTVEDVLK 957 

YIDLIFPDCLAPRVMLEPAAETIPQTQEIISVTLQVLSLLPPYRQSISVLAGSTVEDVLK 
Sbjct: 301 YIDLIFPDCLAPRVMLE PAAET I PQTQEI I SVTLQVLSLLPPYRQS I SVLAGSTVEDVLK 360 

Query: 958 KAHELGGFTYETQASLSGPYLTSVMGKAAGEREFWQLLRDPNTPLLQGIADYRPKDGETI 1137 

KAHELGG FTYETQASLSGPYLTSVMGKAAGEREFWQLLRDPNTPLLQGIADYRPKDGETI 
Sbjct: 361 KAHELGGFTYETQASLSGPYLTSVMGKAAGERE FWQLLRDPNTPLLQGIADYRPKDGETI 420 

Query: 1138 ELRLVSW 1158 

ELRLVSW 
Sbjct: 421 ELRLVSW 427 
(SEQ ID NO:6) 

HMM results : 

Model Description „ Score E-value N 

PF01122 Eukaryotic cobalamin-binding protein 829.9 8.6e-246 2 

CE00052 CE00052 lymphocyte_transmeinbrane_protein_KAP 3.2 2.9 1 

Parsed for domains: 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 

CE00052 1/1 1 11 [. 1 11 [. 3.2 2.9 

PF01122 1/2 1 142 [. 1 143 [. 296.0 4.6e-85 

PF01122 2/2 143 376 .] 197 450 .] 531.8 4.8e-156 
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Isoform 2: 

1 MRHLGAFLFL LGVLGALTEM CEIPEMDSHL VEKLGQHLLP WMDRLSLEHL 
51 NPSIYVGLRL SSLQAGTKED LYLHSLMLGY QQCLLGSAFS EDDGDCQGKP 
101 SMGQLALYLL ALRANWHDHK GHPHTSYYQY GLGILALCLH QKRVHDSVVD 
151 KLLYAVEPFH QGHHSVDTAA MAGLAFTCLK RSNFNPGRRQ RITMAIRTVR 
201 EEILKAQTPE GHFGNVYSTP LALQFLMTSP MRGAELGTAC LKARVALLAS 
251 LQDGAFQNAL MISQLLPVLN HKTYIDLIFP DCLAPRVMLE PAAETIPQTQ 
301 EIISVTLQVL SLLPPYRQSI SVLAGSTVED VLKKAHELGG FTYETQASLS 
351 GPYLTSVMGK AAGEREFWQL LRDPNTPLLQ GIADYRPKDG ETIELRLVSW 
(SEQ ID NO: 4) 



FEATURES : 

Functional domains and key regions: 

PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 
198-200 TVR 



PDOC00006 PS00006 CK2_PHOSPHO_SITE 
Casein kinase II phosphorylation site 



Number of matches: 7 

1 67-70 TKED 

2 90-93 SEDD 

3 147-150 SVVD 

4 198-201 TVRE 

5 250-253 SLQD 

6 273-276 TYID 

7 326-329 STVE 



PDOC00008 PS00008 MYRISTYL 
N-myr istoylation site 
Number of matches: 7 



12-17 
57-62 
86-91 
173-178 
214-219 
233-238 
254-259 



GVLGAL 
GLRLSS 
GSAFSE 
GLAFTC 
GNVYST 
GAELGT 
GAFQNA 



PDOC00009 PS00009 AMI DAT I ON 
Amidation site 

186-189 PGRR 

PDOC00428 PS004 68 COBALAMIN_BINDING 
Eukaryotic cobalamin-binding proteins signature 
165-178 SVDTAAMAG L AFT C 



SignalP results: 

Measure Position Value Cutoff Conclusion 

max. C 19 0.602 0.37 YES 

max. Y 19 0.702 0.34 YES 

max. S 5 0.974 0.88 YES 

mean S 1-18 0.949 0.48 YES 

Most likely cleavage site between pos. 18 and 19: ALT-EM 



BLAST Alignment to Top Hit: 

>CRA! 108000024636236 /altid-gi t 298316 /def-gb I AAB2552 6. 1 I 

transcobalamin II, TC II [human, endothelial cells, 
Peptide, 427 aa] /org-human /taxon=9606 /dataset=nraa 
/length-427 
Length - 427 
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Score - 793 bits (2026), Expect =0.0 

Identities - 399/427 (93%), Positives = 399/427 (93%), Gaps - 27/427 (6%) 

Query: 1 MRHLGAFLFLLGVLGALTEMCEIPEMDSHLVEKLGQHLLPWMDRLSLEHLNPSIYVGLRL 60 

MRHLGAFLFLLGVLGALTEMCEIPEMDSHLVEKLGQHLLPWMDRLSLEHLNPSIYVGLRL 
Sbjct: 1 MRHLGAFLFLLGVLGALTEMCEIPEMDSHLVEKLGQHLLPWMDRLSLEHLNPSIYVGLRL 60 

Query: 61 SSLQAGTKEDLYLHSLMLGYQQCLLGSAFSEDDGDCQGKPSMGQLALYLLALRAN 115 

SSLQAGTKEDLYLHSL LGYQQCLLGSAFSEDDGDCQGKPSMGQLALYLLALRAN 
Sbjct: 61 SSLQAGTKEDLYLHSLKLGYQQCLLGSAFSEDDGDCQGKPSMGQLALYLLALRANCEFVR 120 

Query: 116 W HDHKGHPHTSYYQYGLGILALCLHQKRVHDSVVDKLL 153 

W HDHKGHPHTSYYQYGLGILALCLHQKRVHDSVVDKLL 
Sbjct: 121 GHKGDRLVSQLKWFLEDEKRAIGHDHKGHPHTSYYQYGLGILALCLHQKRVHDSVVDKLL 180 

Query: 154 YAVEPFHQGHHSVDTAAMAGLAFTCLKRSNFNPGRRQRITMAIRTVREEILKAQTPEGHF 213 

YAVEPFHQGHHSVDTAAMAGLAFTCLKRSNFNPGRRQRITMAIRTVREEILBCAQTPEGHF 
Sbjct: 181 YAVEPFHQGHHSVDTAAMAGLAFTCLKRSNFNPGRRQRITMAIRTVREEILKAQTPEGHF 240 

Query: 214 GNVYSTPLALQFLMTSPMRGAELGTACLKARVALLASLQDGAFQNALMISQLLPVLNHKT 273 

GNVYSTPLALQFLMTSPMRGAELGTACLKARVALLASLQDGAFQNALMISQLLPVLNHKT 
Sbjct: 241 GNVYSTPLALQFLMTSPMRGAELGTACLKARVALLASLQDGAFQNALMISQLLPVLNHKT 300 

Query: 274 YIDLIFPDCLAPRVMLEPAAETIPQTQEIISVTLQVLSLLPPYRQSISVLAGSTVEDVLK 333 

YIDLIFPDCLAPRVMLEPAAETIPQTQEIISVTLQVLSLLPPYRQSISVLAGSTVE DVLK 
Sbjct: 301 YIDLIFPDCLAPRVMLEPAAETIPQTQEIISVTLQVLSLLPPYRQSISVLAGSTVEDVLK 360 

Query: 334 KAHELGGFTYETQASLSGPYLTSVMGKAAGEREFWQLLRDPNTPLLQGIADYRPKDGETI 393 

KAHELGGFTYETQASLSGPYLTSVMGKAAGEREFWQLLRDPNTPLLQGIADYRPKDGETI 
Sbjct: 361 KAHELGGFTYETQASLSGPYLTSVMGKAAGEREFWQLLRDPNTPLLQGIADYRPKDGETI 42 0 

Query: 394 ELRLVSW 400 

ELRLVSW 
Sbjct: 421 ELRLVSW 427 
(SEQ ID NO:7) 



HMM results : 

Model Description Score E-value N 

PF01122 Eukaryotic cobalamin-binding protein 906.3 8.6e-269 2 

CE00052 CE00052 lymphocyte_transmembrane_protein_KAP 3.2 2.9 1 

Parsed for domains : 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 

CE00052 1/1 1 HE- 1 11 E. 3.2 2.9 

PF01122 1/2 1 115 [. 1 H5 [. 241.3 1.4e-68 

PF01122 2/2 117 400 .] 145 450 .] 660.5 8.7e-195 
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1 ATATGTATGG GAAATATGCT GTCTTCCTAT TCCTACTCCC CCACCCTCTA 
51 GCACTGAGTC CAGGTAGGTA GGCAGGGGGG TGTCTCCCTC CTTTACTTCG 
101 ACACCCTAAC TACCTTGGGG ATCAGAAGTG ACTCTCTGGA AGGATGCTGC 
151 TGCTTCTCAC CAGAGGCTGA CGATAACGAA GGCTATCCTC CATGGCCACC 
201 TCCTCCAGGC TGCCTTCCTG GAAATAGGAA TCATAATAGT TGTTACTGGA 
251 AACAGGCAGA GGGTTGGGGG AGCCAAGGCA GTCCCACCCA GGACCAAGGT 
301 GGCTCCATTG CACACACTTC ACCATGACTC CCCTGAAGGT CCAAACGTGC 
351 GGTTCTGCGG AAGTTGGGCT CCCCACTGGC CTCCCTCCTT CCTCAGAACC 
4 01 TCCAGGGGTG CTCCTCCTAG TGGCCACATC CAGCCTTTCT GACTGGACAA 
451 CCTATCATTT AAAATTTTCA AGTAGTTCCG TAAACAGACA CACGTTGCTG 
501 TATTTATTTA TGTCAAGGGC TTGGTTTGTG ATAAGTCAGG CTCAAAAAGA 
551 TTGTCTTAAA AGAGTGAACC TTGGCAATTT AC CAT A A A AT AATTGCAATG 
601 CAGATTGTGC ATGGAAATGA TTGGAGATAT TTTAAGGTCA TAGTGTCTTC 
651 ACAAATTGAG CTGAAAGGGA ACTGTTAGGA TGATCTTGCC TAACCCTCTC 
701 ATCTCACACA GGAAGAACTA TTTTAAACTC GAGAGGTTAA GTGACCTGGC 
7 51 CAAAGTCACA CAGCCACCAC TAGTTAACTC GTATACATTG ATTCTCCTGT 
801 GGGGCTGGGC AGATGAGGAA TCTTTTGTTC TCTTCCCTGT TTGCAGAGAT 
851 TTTTTTTGAG GTTACTTTCC GAGTTCTGGC AAGTACCCCT GCTTCTGGTA 
901 GCTTTGTGTC TCGATTCAAT CTCATTCTTT TTATTTTATT TTATTTTTGA 
951 GACAGGGTCT CACTTTGTCA CCCAAGCTGG AGTGCAGTGG TGTAATCTTG 
1001 GCTCACTGTA GCCTCCACCT CTTGGGTTCA AGCGATCCTC CTGCCTCAGC 
1051 CCCCCAAGTA GCTGGGATTA CAGACGTCTG CCACCACGCC AGGCTAATTT 
1101 ATGGTTTTTT GTATGTGTTT TTTGTGTTTT TGTAGAGACA GTGTTTCCCC 
1151 ATGTTGCCCA GGCTGGTCTC CAACTCCTGA GCTCAAGTGA TCTGCCCGCC 
1201 TCAGCCTTTC AAAGTGCTAG GATTACAGGT GTGAGCCACC GTGCCCGGAC 
1251 TTAATCCCAT TCTTTAACTT GTTTTGTTTT GTCCTCTCCA GGAGGCTCCC 
1301 AGCCCTTTCG GATTGGTTGA GAAAAGTGGC CTGGCTGGTC TGGGGCCAGC 
1351 AGCACCCACC CTCCCCTCAA TTGCCCAACT CCCCCCCCCA CCGAACTGCC 
1401 CAACTCCCCC TCCCCAACTG CCCAACTCCC CCACCCCCAC AATCCCCTCC 
1451 CGCCACAACT GAGGGAGGCG GTGCTGAAAA ACAGCTGACT CCAGCAATGC 
1501 TGCTCACGTG ACCACTGCAG CTGCAGCTCC CGTTCCACTC CTTGTCCTGG 
1551 GCTAGGTGGG CACTACCAGG GGCTCCTTTG GTAAGGAGTA CCGGGTAGGC 
1601 ACCCGGTCCT GCCAATCCAC CACTGGAACA GCTGGGGGGA CAGCAGACAG 
1651 GCACGGTCGG ACAGACTTGA CAGATCAGGC ATCAGGCCCT CTGCGCTGGT 
1701 CCCGGGCTCT TTAAGCAGGA ACGTGAATGG CCTCAAGATG TCTCACATGG 
1751 TCCCACTAGC CCTCCTCCTC CCTTTGTTCC CTACCTCCAG GAGGGCTGCT 
1801 CTGCCCTTCC TTCCTCTGTT CTTTGGCCTT ATGTTCCCCG CCACCACAGG 
1851 CCTTCCCCCG CCCCACCCCT CTGCAGACTT AGCCGTGCAT TGCAGGCATG 
1901 GAGGATTAAT CAGTGACAGG AAGCTGCGTC TCTCGGAGCG GTGACCAGCT 
1951 GTGGTCAGGA GAGCCTCAGC AGGGCCAGCC CCAGGAGTCT TTCCCGATTC 
2001 TTGCTCACTG CTCACCCACC TGCTGCTGCC ATGAGGCACC TTGGGGCCTT 
2051 CCTCTTCCTT CTGGGGGTCC TGGGGGCCCT CACTGAGATG TGTGGTGAGT 
2101 AACTCGCCTC TATCCTGTGC CTCTTTCCTC CTGGGTCCTT AGTGGGGTGG 
2151 CTAGGGCATA GGATGAGGGA ACTTACCTGC CCTTCTAAGC TCCCATAGCA 
2201 GTTTGGGCTT AGCTGGACCT CAGCATTTAA CACATCCTAT TGTGATTGAT 
2251 TATATGTTTG ACTCCTCACC AGACAAGATC TCCGTTAATT CAGTCATTCG 
2301 TTCACACATT CATTCAGCGC ATACTGAGCC TTTTCTGTGT CAGGCCCAGT 
2351 GTTAGCCTTT GGGGAACGTG CAAAGCATGA GACAAGTCTA ATCCCTGCCA 
2 401 TCCTAGAGCT TATGTTCTAG GGAAGGGGGA CAGACAAAAG AAATGGTTAG 
2451 GTGCTCCCAC CTGAAATCTC AGCATTTTGG AAGGCTGAGG CGGGAGGGGA 
2501 GGATCGCTTG AGCTCAACAG TTCAAGGTCA GCCTGGGCAA CATAGGGAGA 
2551 CCCCATCTCT ACAAAAAATA AAAAAAATTA AAAAATAGCT GGGCATGGGG 
2601 AAGACTTTCT GAAGACCAAG AGGACACATG GGAGCTGAAA CTCGAAGGAA 
2 651 GAAAAGGAGC TGGCAGGAAA GGAGTGGGGG ACACACATTC TAGGCAGCAG 
2701 GAAGTGAGCC TTCGGAGGTC CTGCCTGCTC CAGCTCTGTG CCCCAAGGGG 
2751 TCTCTTGGAG CACAGTCTCC TGGGACCTGT CTATGAGTCT GAGCTTAGAG 
2801 GCTCAGGGCT GCTCCTTCAG ACAGGAGGCA GAAGGCAGAC TTTGGGAACT 
2851 TTGGGCCGCC CACGCGCCTT TTCTCCTCCT CTGCACCTAG GATTACGTTG 
2901 AGCAATACAC TTTCACCCCC ATGGTCTCTT GAGACCCTGG GGAAACCCTG 
2951 AGAGGTGGGT GCAGTCATGT CCAGGTGTCA AGTGAAGAAG TCGAGGGTTG 
3001 GAGGGGCTGA GTGACCCACT CAGGGTGCTC CACCTTTTCC AGAGCTTTGC 
3051 TGAACTTAGT TTTTAGAACT TGAAGCCTCG TTTGTTTTCG TTTTGTTTTT 
3101 TGTTGAGAGA GGTTCTCCCT CTGTTGCCCA GGCTGGAGTG CAGTGGCACG 
3151 ATCTTGGCTC ACTGCAGCCT CTGCCTTGTG GGTTCAAGTG ATTCCCCCAC 
3201 CTCAGCCTCC CAAGTAGCTG GAGACTGCAT GTGCATACTA CCATGCTTGG 
3251 CTAATTTTTG TATTTTTTTG TAGAGACAGG GTTTCGCCAT GTTGCCCAGG 
3301 CTGGTCTCGA ACTCCTGGGC TCAAGTGAAA CTCTTGCCTC GGCCTCCCAA 
3351 ATTGCTGAGA TTACAGGCGT GAGCCACCGT GCCCGGCCAG AACTCCAAGC 
34 01 CTCTCATCTG TGTTCCATAA ATGCAATCAG ACACCTCAGG TCTGGGCCCA 
3451 GGAACCCCAG CTCTTGGTTC ATGTCCGGAC AGTCCCCAGG GGAGTTCTGG 
3501 GTTCAACCAG CAAGAGCTCT TCCTCCTGGC TGATCTGGTC CTCAGCCTTG 
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3551 GACAGTTAGT CCATTAACCT GACCCCACAG GAGCCCCAAT CCCTTGGGGT 
3601 CTGGGGAATC TTGAACTGGG GTTTGGGGTG CAAATATCTG CACTGAGTCA 
3651 CTTAATTGCA CCCAGCCTCA TTCCTTTATC TGTAAAGTGG GCTAAGAATG 
3701 CTCCCCTGCC TTCCTCCTCG GTGTAGTACG AGGAAGGATC CCATGACACC 
3751 TGCTCTCCCA GTTTAAAGCT CTATATGTAT GTTGTGAAAT TGACAGGGAT 
3801 CGCTGCACAA ACGCTAATGC AAAGTGGGCT CCTGTGCTTC CTTTTCTCTT 
3851 TCTTCTTCTT TTTTTTTTTT TTAATTTTCT TCTAGAGATG AGGTCTCACT 
3901 ATATTGCCCA GGGTTGGTTT CAAACTCCTA GGGTCAAGCG ATCCTCCCAC 
3951 CTTGGCCTCC CAAACTGCTG GTATTACAGG CGTGAGCCAC TCTGTCTGGC 
4001 TCCTATGCTT GTGAATGTCA ACAGCAATCA GCCCTTAGCT GGCAGGGCTG 
4051 GGTTGGTAGG GCGAGAGCTC ACCCAAGGCT GCTTTTATTA CCCTGCGTGA 
4101 ATCTGCCTGG CCCCTTCCTT CTAAGGAGGT TGCTCTGTGG TTGTCAGTCT 
4151 CTCCCTTTAC AGCTGGATCC TGATCTTTCA GTTTCTAACC CTGTGCTGAC 
4201 TCATCGTGCT GGAAGTGAGA GCCCGGGGTG AGGTCAGGGA ACTCCCTTGC 
4251 GCGTTTCAAG AAAAGGGAAA AGGAAAGAGA GGTGAGGAGG GGGGCAGATG 
4301 AC C AG AG AG A CACAGGCTGA GAGAGACTGA GACAGACCCA GAGAGCCTCA 
4351 CACATTGAGT GACAGAGACG GAGAAATGGA GATAGGCACC AAAAAATGGT 
4401 TCTCAGTGAC AGAAAGGGAA AAAAGCAACC CCCCAGTCTC TCTTAACATC 
4451 TGGTGAGAAA CCAGCCATGT GCTTTGGTCT GGGCCCACAC AGCAAAGGAT 
4501 TATGTAGGGT TTCATGCTGG TGGATGGTCA CCTTATAGCA ACAGGTATCT 
4551 GGGGCTGTCG GGAAAACAGA CACGAGGTTG TGGGACCCAG ACCCACAGAG 
4 601 ATGGAGCTGT TCTAGGAGCT CTGGTCCTCG TTCTGGTCCC CTGGGATATG 
4 651 GCACAGTGAA GGCCACCATC AGGCAGCTGG AGCCCAGCAG CAACTGGGAG 
4701 GCAGTAAACA GGGACCGAAA GTGCAAGGTT ACCTCCGAGG CAAACTACTC 
4751 TAAGCTACCC TGTGCTGAGC TCAAGTCCCT TGGAACTATC CCTAAGGCTT 
4801 CCGCTTCCAG AGTGTTTGAG TATTTTCGTT GCACAGCTTC GAATAAATCC 
4851 CACAGCAACA GGTAAACGGC TGCAAGCTGT GACTGTTTTC TAAGAGCTCA 
4901 TCTCACAATC TCAGGTCCTC TTCATTTAAA CAGAGATGGC AGGAAAGGCG 
4951 TTATTTTGAG ATCTGCATGG AGGAAGTTCA CCAGGCAGCC TCAATTCACC 
5001 AGCTGGAAGT TTGCGTTGTT TGGAAATTTG ATGTGTAACA CGTTCTGCAT 
5051 GTGGGCTGAT GTTTTTGTAA ACGGGTAGCA CACACATTCA GCAGGGCACC 
5101 AAAGAGCGGG GGCTTTGCAG TTAGGTCCAT CCTTGGCTCT GCAGCCTTGT 
5151 GTAAGACATG ACACGACTTT GAACTTCTGT TTCCTCTTCT GTGCAAAGCA 
5201 ATGATGACAG TATCTACATC ACAGGACTGG CAT GAG G AC C AAGTGAGATT 
5251 GGGCAAGGTG CCCGGGCACA CCAGTCTCAC TGTCACTGCT GATGGGCAGA 
5301 GTGGTTGCCT GGCAGTAGCA TCCTCTATCT TCAGCCCACC ACCTCTCTTG 
5351 CTGGCTCACT CCAACTGCTC TTTAGAGATA CACGCTTCCC CTCTTTTCTC 
5401 CTCCCACTGC CTTTCAGTAT GGCTGCATTT CCCCCTGCAA GTTGGTGTGT 
5451 GCTGGGTGGA GGTGGGGGTG AGGACATGTA TTCTCTGGAG AAGGCCCTGG 
5501 TAACGTCAAA GCACTTCTTT GCTGGTGGCC TGGCCCTGTG ACCTCATTTG 
5551 TACCATTTTC TTTTCTAAGA AATACCAGAG ATGGACAGCC ATCTGGTAGA 
5601 GAAGTTGGGC CAGCACCTCT TACCTTGGAT GGACCGGCTT TCCCTGGAGC 
5651 ACTTGAACCC CAGCATCTAT GTGGGCCTAC GCCTCTCCAG TCTGCAGGCT 
5701 GGGACCAAGG AAGACCTCTA CCTGCACAGC CTCAAGCTTG GTTACCAGCA 
5751 GTGCCTCCTA GGGTATTGCC ACACTCTCTT TTTCCATGTC TTGCTCCACA 
5801 T AC T A AG AG A TGGGAAACTT GGGTACTAGT TTGGGCCTGT CACCACTTTG 
5851 TGGGCAGACC TTAGGCAAAT TTTCTCCATC TATAGAATGG AGGACCTTTG 
5901 TCCATCTATA GAATGAAGGG GTTGGTTGGA TTAGATCAGA GATGCTAATG 
5951 CAAGGCTCCT TTTGCTACTA CTGTCCATCA TGTGTCTGAG GCAGACATAA 
6001 CTAATCCGTG ACTATACTCT TTGATGATGA GCCCAGGAGC AGCATCTGAC 
6051 TCTATGCTCC CTTAGTGTGC CTGAGGCAGA TATCACTAAT CGATGACTGC 
6101 AGTCTTCTAC ATTGAGCTTA GAAGCAGCAT CTGACTCTGT ATGCTCCCCT 
6151 CCCATGCATG AGGCAGACAT CAGTAATCCA TGACCGCATT CTTTCATACT 
6201 GAGCCCAGAA GCAGCATCTT TTCTTTTCTT TCCTCTCACT CTGTTGCCCA 
6251 GGCTAGAGTG CAGTGGCACA ATCTTGGCTT GCCCCAACCT CCAATTCCCG 
6301 GGTTCAAGTG ATTCTCGTGC CTCAGCCACC TGAATAGCTG GGATTACAGG 
6351 CGTGTGCCAC CATGCCCAGC TGATTTTTGT ATTTTTGGTA GAGATAGGGT 
6401 TTCACCATGT TGGCCAGGCT GGTCTTGAAC TCCTGACCTC AGGTGATCCG 
6451 CCTGTCTTGG CTTCCCAAAG TGTTGGGATT ATAGGCATGA GCCACTGCAC 
6501 CAATCCAAAA GCAGCATCTT TGTGCTCCCT TTTCAAGAGG CATCACAGAG 
6551 AGGCCTGTTT TGGGGTTTGA ATGAGAGGCG AAGAATCAGC CATGGAGTGC 
6601 CTCTTTCTCA GACTCCCTCT TGAGAAGTGG GTGCAGGGGT G GAG AG AAA A 
6651 G A AG AC TAG G CATAGTGGCT CATACCTGTA ATCCCAACAT TTTGGGAGGC 
6701 TGAGGCAGGA AGATTGCTTG AGCTCAGGAG TTTGAGACCA GCCTAGGCAA 
6751 CATAGTGAGA CCACATCTCT T AAA AAA A AG AAAAAGAAAA AAAATGAGCC 
6801 AGGTGTAGTG ACTCATGCCT GTGGTCCCCA CTTCTCCGGA GGCAAAGGTG 
6851 GGAGGATCTT TTGAGGCTGA GAAATCGAGG CTACAGTGAG CCATGGTGGC 
6901 ACCACTGCAC TCCAGCCTGG GAGACAGAGA GACCCTATCT CAGTAAAAAA 
6951 AAAAAATAAA AATATGGCTG GGTGTGGTGG CTCACGCCTG TAATCCCAGC 
7 001 ACTTTGGGAG GCCAAGGTAG GTAGATCACA TGAGGTTAGG AGTTCGAAAC 
7051 CAGTCTGGCC AACATAGTGA AACCCTGTCT CTACTGAAAA TACAAAAAAT 



FIGURE 3, page 2 



7101 TAGCCAAGGG TGGTGGTGGG CAACTGTAAT CCCAGCTACT TGGGAGGCCG 
7151 AGGCAGAAGA ATCGCTTGAA CTCGGGAGGC GGAGGTTGCA GTGAGCTGAG 
7 201 AACATGCCAC TGCACTCCAG CCTGGGCAAC AAGAGCGAAA CTCTGTCTCA 
7251 AAGAAAATAA ATAAATAAAA TAAAAAAATA AAAAAGGAGG GGGCATATGG 
7301 GTGAAGTATG GACAAAATAG TGGGGCAGGC ACAGATGATC TGGACACAGG 
7 351 AGCCCTTGGA GTTTATTCTT GAATCTAACT GTTCATCTTT ATTAAATATT 
7401 TGTGGCATAC ACCTCACAAC AACATAGCCA ACACACCTCC TTTTGGAGCT 
7451 TTTATCGAAG TTTCCCACTG TTAAGATTTT TTCCCGCTTT GTGATGCGGG 
7501 TGGGGTGGGT GCTGTAAGCA GGCTTACGGG GTGGCAGTTT CTCACAAAGG 
7551 CATTAACTGG CCTTGTCCTA GGTCTGCCTT CAGCGAGGAT GACGGTGACT 
7 601 GCCAGGGCAA GCCTTCCATG GGCCAGCTGG CCCTCTACCT GCTCGCTCTC 
7 651 AGAGCCAACT GTGAGTTTGT CAGGGGCCAC AAGGGGGACA GGCTGGTCTC 
7701 ACAGCTCAAA TGGTTCCTGG AGGATGAGAA GAGAGCCATT GGTGAGCAGA 
7751 CACCATCCGC TGGGGGTGGG GAGCAGCTGG GAGGGCTCAT CAGATGATAT 
7801 TCTCCAATGA GAATCAGAAC TTTGGGTTTT CTCCCCAGGC GTCTTTCCCA 
7851 CCATCCATTC TGCCCATCTC ACTGCCTACG TAGAGGCTCG AACCTGTCCC 
7901 CATAGCCATC CTTGACCCAG CTTTTCCCGC GCTGCACACA TACTATTGAC 

7 951 AGGTGTGTTT CGTGGTTTTT TGTTTTTTGT TTGTTTGTTT GTTTTGAGTT 
8001 GGAGGTTTGC TCTTGCTGCC CAGGCTGGAG TACAATGGCG CAATCTCAGC 
8051 TCACCGCAAT CTCTGCCTCC TGGGTTCAAG CAATTCTCTT GCCTCAGCCT 
8101 CCTGAGTAGC TGGGATTACA GGCATGCGCC ACCACACCCA GCTAATTTTG 
8151 TATTTTTAGT AGACGTGGGG TTTCTCCATG TTGGTCAGGC TGGTCTCGAA 
8201 CTCCTGACCT CAGGTGATCC GCTTGCCTTA GCCTCCGAAA GTGCTGGGAT 
8251 TACAGGCATG AGCCACTGCG TTAGGCCCAC TGACAAGCCT TGTATTGGCT 
8301 AGCCACCAAG ATTGACTTGA TTATCCACCT TCGGGACAAC TGGACAGCCT 
8351 GCTTATGACT TACGCCATAG TCTGTCTCTA CTAGCTCTCC TGCCCTGACT 
84 01 TGACCCAGCA TACAACAGCC AGAGCCAGCC TTTTCAATAT AAACCTGATC 
8451 TTGCTGGCAC TGCTTAAACC CTGCAGGGGC CTCGCACTGC TCCATGGCCC 
8501 AGCCTGTCTA CCCTTACCTT CTGCCCAGGC TCTGCTCATC CATTCTCTGC 
8551 CTCCCACACA CCTGCCCTCT GTGGGCTCCA GC CAT AC CAT CTCTCAACTC 
8601 ATAAGCCAGT TTTTTCATAC AGGCTCCCTC CATCTGGACT GGCTTCCCTG 
8651 CGTGCAGTTC ACTCCTGCTC TACCTTTGGC TCTGCCTCCA CCCATCCTCA 
8701 GCCGTCTCCA GCATTACCTC CTTGGAGAAT CCTGCCTTGA CTTCCCAGCC 
8751 ACCCAAATAT CACTACTTGG TCTGCATTCT CGTTGCAATT GCAGTCGCAT 
88 01 GAGCAATTGC TGTGGTTGAG GCCCGAACTG CGCAAGTGCC TGTCTGCCAT 
8851 GGGTCTCCTG CTTCCTCTAA GCACAGTGCC TGACACACAG TGAGACCTCA 
8901 GCACGTATGG GCTGAGGCAA TGAAGGAATG AAGGATCCCA TGACCCAAAA 

8 951 GAGCCTGTTG GAAAGTGCAG GCCAGGGTCC CAGGTGCTGG CGGGGCTGGC 
9001 TGCTGGGTGG GGGCAGAGAG GCAACCCCTC TGTTTTTTTC CCTCTCAGGG 
9051 CATGATCACA AGGGCCACCC CCACACTAGC TACTACCAGT ATGGCCTGGG 
9101 CATTCTGGCC CTGTGTCTCC ACCAGAAGCG GGTCCATGAC AGCGTGGTGG 
9151 ACAAACTTCT GTATGCTGTG GAACCTTTCC ACCAGGGCCA CCATTCTGTG 
9201 GGTGAGTAGG TCAGACCGTG CCAAGGCCAG GCTGGCACTC CCTCAGTCCC 
9251 CAGGTCTGCA CTGATGACGT CCATACCCTG GCCCCCACAC TCACCTTTCC 
9301 TTGGGGCTCC TCCGAATCAA GTCCTTTAGG GACGAATTGG CGAGGGCTCA 
9351 TGGGTGATGC TCCAGCTGTG AGCCAGCTTT GGAGCTGGTA GGTGGATCTC 
9401 TTGAGGCCAG GAGTTCAAGA CAACGTGGTG AAACCCCATC TCTACTAAAA 
9451 ATAAAAAAGT TAGCCGGGCA TGGTGGCACA TGCCTGTAGT CCCAGCTACT 
9501 CGGGAGGCTG AGGCAGGAGA ATCACTTGAA CCTGGGAGGC GGAGGCTGCA 
9551 GTGAGTGGAG ATCGCACCAC TGCCCTCCAG CCTGGGCAAC AGAGTGAGTG 
9601 AGACTCTGTC TCAAAAAATA AAAAATAAAA TAAAACTCCC CTAGTGATTC 
9651 CAATGTGCAG CTAAGTTTGG AAATAGGTGG TATGGGGTCA AGTCCTCTTG 
9701 GGCCTCCCTC CTCCAGTCCT TCTCCCTAAC CTCTAGCCCT CAAGTTGCAG 
97 51 AGTGATCAGC CAAACCAGTT TGCCCAGAAA TGAGCAGTTT CCTGGGACAC 
9801 AGGATTTTCA GAGTCCAGAC AAGGAAAGTC TTGGGCAGAC CAGGTTGAGT 
9851 TGGTGCCCTT AGCTGATCTG ACCATGTTGC CCTTCTTCTC CAAGCCCTCC 
9901 TGTGGTTGTC CATAGCTACA AGGGCCTGAC CCTCAAGCCC CTGCCTGTCC 
9951 TGGCCCCTTT GGCTCTCCAG CTCATTGCAT GTTCTGTCCC CCACTTCAAG 

10001 ACACAGCAGC CATGGCAGGC TTGGCATTCA CCTGTCTGAA GCGCTCAAAC 
10051 TTCAACCCTG GTCGGAGACA ACGGATCACC ATGGCCATCA GAACAGTGCG 
10101 AGAGGAGATC TTGAAGGCCC AGACCCCCGA GGGCCACTTT GGGAATGTCT 
10151 ACAGCACCCC ATTGGCATTA CAGGTGGGAA AGAGACCCTG GAGCCATGGC 
10201 CACCCTGGGG AACAGTCGGG TGGAGTGGTC AGGTGCTGGA ACACCTAGCC 
10251 CCTCCCTGCC GGCTGACCTC CTCTCTCTCT TCCTCACTCT ATCACCAGTT 
10301 CCTCATGACT TCCCCCATGC CTGGGGCAGA ACTGGGAACA GCATGTCTCA 
10351 AGGCGAGGGT TGCTTTGCTG GCCAGTCTGC AGGATGGAGC CTTCCAGAAT 
10401 GCTCTCATGA TTTCCCAGCT GCTGCCCGTT CTGAACCACA AGACCTACAT 
10451 TGATCTGATC TTCCCAGACT GTCTGGCACC ACGAGGTAGC CCAACTTTTT 
10501 GTGGAAGCAC AGCCCTTTAC AATCTGCTGC GCACCCATTG ACGTCCCAGT 
10551 GAGGGGAGGT TGCTTCATCC TGATTTGCTG AGTCAGCACA AGTTTGTGGG 
10601 TGTGCATGGG ACACAGTAGC CAAAATGTGG TCATAGCTTC TAGAAGCTCA 
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10651 CAGTGTGGGG AGGAAGACAG TAAATGGAGA TCCCTGGGCA TATCGCTTGT 
10701 GTGATACCCA GTACAGAAAT GTTTGGATGG ATGGATGGAT GGATGGATGG 
10751 ATGGATGGAT GGATGGATGG AT GAG GAG AG ACACATTTTG GTTAACTCTA 
10801 ATACAACATG ATAAGCCCCA GTAGCAGCAT GATCCAGGCT TTCTCTGAGA 
10851 GAGGGTCTGA GGACGTGACT GGGATTTGCC AATTAAGAAT GGAGAAAGAG 
10901 GCCAGGTGCA GTGACTCATG CCTGTAATCC CAACACTTTG GGAGGCCGAG 
10951 GCGGGTGGCT CACCTGAGGT CAGGAGTTCG AGACCAGCCT GGCTAACATG 
11001 GCGAAACTCC AT CT ATT AAA AATACAAAAA AGTAGCTGGG TGTGGTGGCG 
11051 AGTGCCTGTA ACCCCAGCTA AGCTACTCAG GAGGCTGAGG CAAGAGAATC 
11101 ACTTGAACCT CAGAGGTGGA GGTTGCAGTG AGCCAAGATC ATGCCACTGC 
11151 ACTCCAGTCT GGGTGACAGA GTAAGACTAT GTCTCAAAAA AAAAAAAAAA 
11201 AAATGGAGAA GAAGGAAGCT GGACATGGTG GCTCGTGCTT ATAATCCTAG 
11251 CACTCTGGGA AGCTGAGGCA GATGGATTGC CTGAGCCCAG GAGTTTGAGA 
11301 CCAGCCTGGG CAACATGGTG AAACCCTGTC TTTACTAAAA TACGAAAGAT 
11351 TAGCCAGGCA TGGTGGTAGA CACCTATAAT CCCAGCTACT AGGGAGGCTG 
11401 AGCCACAAGA ATCACTTGAA CCTGGGAGAC AGAGGTTGCA GTGAGCCGAG 
11451 ATCGCGCCAT TGCACTCCAG CCTGGGCGAC AGTGTGAGAC TCTGTCTCCA 
11501 GAAAAAACAA GAATGGATAG AGTGGAGCCA AGAAGAGGCA GGAAGAACAA 
11551 AGACACAGAG GTGCACAGAG TTTGGGGGAA TTTTGAGGAA TGGTCTTGCA 
11601 AAAGAGTGGG ATCTGGGAGA ATGAGTGGGA GTGGAAAGCA GATGAATGAA 
11651 GAGAAGGTGA GCGCATCAGG GTAACAGAGA TGCGTTGTGA ACAAATGCAT 
11701 GTTCTAGGAA GAGCCCTCTG GAGTGCTAGG TGCCAGAGAG GTGGGAGGAA 
11751 GGATACTGGA AG C AG AG AAA CCAGTGAGGG GCCTGATCTT GGGTGGTGGG 
11801 GAATGAGGGA CAGGGGAGGC CGGGATGGAA GCCAGGTGGT GGGGAATGAG 
11851 GGACAGGGGA GGCCGGGATG GAAGCCAGGT TTCAGCTGAG CAGGTGGCGG 
11901 TGGCATTGAT GGAGATGAGG ACATGGGGAA GGACAAAGTC CAGGTGTCCT 
11951 TGAGGGAAGA CAAGAAGACA AATAATCCAG GCTCTCTGTC CTCACACCAG 
12001 CTGCCCGCCC CTTTCTTCCT GGCACAGTCA TGTTGGAACC AGCTGCTGAG 
12051 ACCATTCCTC AGACCCAAGA GATCATCAGT GTCACGCTGC AGGTGCTTAG 
12101 TCTCTTGCCG CCGTACAGAC AGTCCATCTC TGTTCTGGCC GGGTCCACCG 
12151 TGGAAGATGT CCTGAAGAAG GCCCATGAGT TAGGAGGATT CACGTGAGAC 
12201 TCCCACCTCC CAGTCCTCAC CCCACCCAAC CTCACATGCC TGATAACAGG 
12251 GTCACAGAAA AGACGGGGAA CAGAGGAGAG GGTTCCCTCG GGAGAGACAC 
12301 TGGCCCTGCT TCTGCTTCTA CCTGCTCAGC TCCTTTCTTG CCCACGGTGT 
12351 TATGGAAACA GGGAGCCATA GGCCAGCATT GTCACTGAGA GAGCAGGCTT 
12401 TGGAGGCAGA GCCCCCCAGT TGGAATCCCA ACTCTAACCA GCTAGGTTCC 
12451 AGGTAGGCAC CCACAATTCA C C GAG GAG A A CAGTTGTGCC CCTTCCCTGC 
12501 AGGGCCAGTG TGAAGAGTCC AGGAGTTAGT ACACATAGAG ATAGTGGCAT 
12551 GTGCTTTTTA TATGTGCAAG GTCCAGCACA TAGCAAGCGC TCAACACAGC 
12601 GTTGCTTTCA TCAGAGTAAG AACTGTTTTT TGTTTGTTTG TTTGTTTGTT 
12651 TTTAAGAGAC AGGGTCTCAA TCTTATCACC CAGGCTGGAG TGTAATTGTG 
12701 CAATCACGTC TCACTGCAGT CTCGAACTCT GGGGATGAAG CAACCCTACT 
12751 GTCCTGCCTC AGCCTCCCAA ATAGCTGAGA CTATAGGCAC GTGCCACACA 
12801 ACCCTGGGTA ATTTTTTTTT TTTTTTTTTT GAGATAGGGT CTCTGTCTGT 
12851 TGCCCAGGCT GGTCTCAAAT TCCTGGCCTC AAACCATCCT CACACCTGAG 
12901 GCGCTCAAAA TATTGGGATT ATAGGTGCGA GCCATCATGC TCAGCCAGAA 
12951 TAATAACTGG TTTTTTTTGT TTTTTTTTTG AGACAGAGTC TCACTCTATT 
13001 ACCCAGGCTC TGGAGGCCCA ACTCGTGTTT GTGTATTTGT TTATTTTTAT 
13051 TTATTTATTT ATTTCGAGAC AGAGCCTCTC TCTTTCACCT AGGCTGGAGT 
13101 GCAGTGGCGC AATCTCGGCT CACTGCAACC TCCGTCTCCT GGGTTCAAGT 
13151 GATTGTCCTG CCTCAGCCTC CTGAGTAGCT GGTGCTACAG GCGCGTGCCA 
13201 CCATGCCCAG CTAATTTTTG TATTTTTAGT AGAGACAGGG TTTTACTATG 
13251 TTGGCCAGCT GGTTTCTAAC TCCTGAACTC GGGTGATCTG CCTGCCTCGG 
13301 CCTCCCAAAG TGCTGGGATT ACAGGCATGG GCCTCCGTGC CCGGCCATGT 
13351 ATTTATTTAG GCAAGGTCTC TCTCTGTTAT CCAGGCTGAA GTGCAGTGGC 
13401 ACATTCATAG CTCACTGCAG CCTCAAATTA TCCAAGTAAC AGGGACTACA 
13451 GGCATGCACC ACCACACCCA TCTACTTTTT TTTGAGATGG AGTCTCCCTC 
13501 TGTCGCCCAG ACTGGGTTGC AGTGGCACAA TTTCAGCTCA TGGCAGCATC 
13551 TACCTCCCAG GTTCAAGCGA TTCTCCTTCC TCAGTCTCCC GAGTAGCTGG 
13601 GACTATGGGC ATGCACCACC ATACCTGGCT AATGTTTATA TTTTGAGTAG 
13651 AGATGGAATT TTGCCATTTT GGCCAGGCTG GTCTTGAGCT CTTGACCTCA 
13701 AGTGATATGT CTGCCTCAGN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
13751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
13801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
13851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

13 901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
13951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

14 001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
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14201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
144 01 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14 601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14 651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14 701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
147 51 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14 901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14 951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
154 01 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
16001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
16051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
16101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
16151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
16201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
16251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
16301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
16351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
16401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
16451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
16501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
16551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
16601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
16651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
16701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
16751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
16801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
16851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
16901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
16951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
17001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
17051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
17101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
17151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
17201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
17251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
17301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
17351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
17 401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
17451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
17501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
17551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
17 601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
17 651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
17701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
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17751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
17801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
17851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
17 901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

17 951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
18001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
18051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
18101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
18151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
18201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
18251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
18301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
18351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
18401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
18451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
18501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
18551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

18 601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
18651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
18701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
18751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
18801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
18851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
18901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
18951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
20001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
20051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
20101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
20151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNCCAAATC 
20201 AACCAGTTGC ATAAATCACT CCTCTATCTT CCTTGGGGTG GAAAGTGGAT 
20251 GGGAGTTATA ATTTGAGTTC TCTTTTGTCT TAGTCCATTG AAGCTGCTAT 
20301 T AC A A A AT AC CATAAACTGG GTGGCTTATA AACAGCAGAA ATGAGGCCGG 
20351 GTGCGGTGGC TCATGCCTAT AATTCCAGCA CTTTGGGAGG CCAAGGCAGG 
20401 TGGATCACCT GAGATCAGTA GTTCAAGACT AGCCTGACCA ACATGGTGAA 
2 0451 ACCCTGTCTC TACTAAAAAT ACAAAAAATT AGCTGGGGGT GGTGGCGGGC 
20501 ACCTGTAATC CCAGCTACTC AGGAGGCTGA GGCAGGAGAA TCGCTTGAAC 
20551 CCAGGAGGCG GAGGTTGCCG TGAGCTGAGA TCACGCCATT GCATTTCAGC 
20601 CTGGGCACAA AGAGTGAAAC TCCATCTCAA AATGAAATAA AATAACAGAA 
20651 ATGTATTTCT TAACAGTTCT GGAGGTTGGG TGGGCAGTCC CAGATCAGGA 
20701 CACTGACAGA TTCAGTGTCT GATGGGGGCC CACTTTCTGG TGTTACCTGC 
20751 TGGCTGTGTT CTCACATGGT GGAAGGAACA TGGCAACTTT CTGGGGCCTT 
20801 GTTTTTTAAT TTAAAAAAAA AAAATATTTT CCTGGCCCTT GCCTGCTGAA 
20851 GGAACCTCTT TTATAATGGT ACTTAAAAAT TTTTTTTTTT GAGATGGGGG 
20901 TCTCACTCTG TCACCCACGC TGAGTGCAGT ATCACAATCT CAGCTCACTG 
20951 CAACCTCTGC CTCCCTGGCT TAAGCGATCC TCCCACCTCA GCCTCCTGAG 
21001 TACGTGTGAC CATAGGCCCA TGGCACAAAG CCCAGCTAAT TTTTTGTATT 
21051 TTTAGTAGAA ATGTGGTTTC ACCATGTTGC ATAGGCTGGT CTCGAACTTC 
21101 TGAACTCAAG TGATCTGCCT GCCTTGGCCT CCCAAAGTGC TGGGATTCTA 
21151 GGTATGAGCC ACCCTGCTCG GCCTATAATG GCACTTTCCT ATCCCATTGA 
21201 TGAGGCTCTA CTCTCATGAC CTAATCATCT CCCAAAGGCC CTAAGGCCTC 
21251 CTGATACCAT CACCTTTGGG GTTAGGTTTT AACATATACA TTTTGGGGGG 
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21301 ACACAGACAT TTTAGACCAT AGCACCTCCA TTGAAAGGAA ACATTTCTGA 
21351 CACCTGGCTA TCTCAAAGGG CCCTTTCAGT TCCCCTGCAG GCTGCATTCC 
21401 CACATCACCA ACAAGAGCAG CGACACTCAC TCAGAGGTTA AATAACTTGT 
21451 CCAGAGTCAC AGCAGTAATG A AT G AC AG AG CTGGGGCTTG AATCCAGGCG 
21501 TCCTCCTAGA GCCTGGATTC TGTGTAGTGA GTGAAAGCTG ACTCCTGGGA 
21551 GACTTCTGCG TGGTCCTGGT TCTCTCTCCA GACTGCACTG CGCAAGTTTC 
21601 TCTTCCTGAT GGTCCCTAGG GTATTACAAA GACAGTGGCC CTGCCTGTCA 
21651 GGTGTTTTTA TTACCAGATG AGGTCATGGC CTCAGGAACC CTGTAGGAAG 
21701 CTGAGTTCAG AGTCTTTGAG CAGGCTTTAG GGAGGTTCCA GCTTCCCACC 
21751 ACCAAGCCCC AGGTGGATTC TTACAGACTC TAGCCTCAGG GTGGGGGGTC 
21801 TGGAAGATGA GGTTGCGGGG TGCGATATTC TGCCCAATTC GCCCCTCCTT 
21851 GCTCAATCTG TTTCTGCAGG TATTGCTGAC TACAGACCCA AG GAT GG AG A 
21901 AACCATTGAG CTGAGGCTGG TTAGCTGGTA GCCCCTGAGC TCCCTCATCC 
21951 CAGCAGCCTC GCACACTCCC TAGGCTTCTA CCCTCCCTCC TGATGTCCCT 
22001 GGAACAGGAA CTCGCCTGAC CCTGCTGCCA CCTCCTGTGC ACTTTGAGCA 
22051 ATGCCCCCTG GGATCACCCC AGCCACAAGC CCTTCGAGGG CCCTATACCA 
22101 TGGCCCACCT TGGAGCAGAG AGCCAAGCAT CTTCCCTGGG AAGTCTTTCT 
22151 GGCCAAGTCT GGCCAGCCTG GCCCTGCAGG TCTCCCATGA AGGCCACCCC 
22201 ATGGTCTGAT GGGCATGAAG CATCTCAGAC TCCTTGGCAA AAAACGGAGT 
22251 CCGCAGGCCG CAGGTGTTGT GAAGACCACT CGTTCTGTGG TTGGGGTCCT 
22301 GCAAGAAGGC CTCCTCAGCC CGGGGGCTAT GGCCCTGACC CCAGCTCTCC 
22351 ACTCTGCTGT TAGAGTGGCA GCTCCGAGCT GGTTGTGGCA CAGTAGCTGG 
22401 GGAGACCTCA GCAGGGCTGC TCAGTGCCTG CCTCTGACAA AATTAAAGCA 
22451 TTGATGGCCT GTGGACCTGC TACAGTGGCC TGGTGCCTCA TACTCCTCAG 
22501 GTGCAGGGGC AGGGACAAGA GAAGGGGGAA GTAACCCCAT CAGGGAGGAG 
22551 TGGAGGGTGC CTGAGCCGCC ATGTGGGCAT TGGGGGAGTG ATGGGAATGC 
22 601 CAGCAGTGAT GACGTTGACT ACTGACTGAG CACCCACTAC TATGACTGAG 

22 651 CACTCACTCG CTAGATACTA TCTTGAACTG CTCTGTGAGG TTGTTGATAT 
22701 TTTCATTTTT ATCTGTGCTT TACAAATCAG GAAACTGGGA GGCCGGGCGT 
22751 GGTGGCTCAC GCCTGTAATC CCAGCACTTT AGGAGGCCAA GGCAGGTGGA 
22801 TCACAAGGTC AGGAGTTTGA GATCAGCCTG GCCAACATGG TGAAACTCCA 
22851 TCTTTACTAA AAATACAAAA AATTAGCCAG GCATGGTGTT GCATGCCTGC 
22901 ATGCCTGTAA TCCCAGTTAC TTGGGAAGCT GAGGCAGGAG AATTGCTTGA 
22951 ACCCTGGAGG CGGAGGTTGT AGTGAGCCGA GATCACGCCA TTGCACTCCA 
23001 GCTTGGGCAA GAAGAGAAAC ACTCTCAAAA AAAAAAAAAA ATCAGGAAAC 
23051 TGGTGCTCAA AAAGGAAAAG TGACTCACCA AGGTCACAGA CTAGGCAGTG 
23101 ATGCTGGGGG AACCTGGCTC AGGGGACACA GACCTGGCCT GGGGCAGCCT 
23151 TGCAGCTCCT CCACTAAAAT ACTGAAAATG AGGGGCTTCG ATGATGGTTA 
23201 TAATCGTATG GCAGAGCCCC AACTCAACTG GAGCCCTGGG ACCCAGAAGC 
23251 TAGGGTCTCA CTCCCTGCTT TTCCACAAGG CACCATTAGG GCATCACCCC 
23301 AGGCCTCGGC AGCCACGACG CAGGGATCCT GCCTCTCATT GGTTGGGGGC 
23351 TTAGGGGCTC TGGGCTGCCC TCTTGAAGAG GGGGTTCAGC CCAGCGAGGC 
23401 ACCCCCTATG CTGCACCCCA CCAAGGTTAG GAAGAGGTCC TGTCCTCAGT 
23451 GGGGCCCTCT GATGAACAGC CCATCAGGTC TGCGTCCACA TGCCTTGGAA 
23501 GAGATGGTGA CATACTCAAA GTCCTTGAAG CCGCATATTA AACCACCTAG 
23551 AGCACCATCT TCAAACATTT AGGGTCTGAG AAGATAGGGG AAGTAAGCAA 
23601 TTTAAAACAT TTCTTTATAT TGGGCCAGGT GCAATGGCTC ACGTCTGTAA 

23 651 TCCCAGCGCT TTGGGAGGAC GAGGATCACC TGAGGTCAGG AGTTCAAGAT 
23701 CAGCCTGGCC AACATGGAGA AACCCCATCT CTACTAAAAA TACAAAAATT 
23751 AGCTCAGGCG TGGTGATGTG CACCTGTAAT CCTAGCTATT CAGGAGGCTG 
23801 AGGCACAAGA ATTGCTTGAG TCAATATTGC ACCACTGCAC TCCAGCCTGG 
23851 GCAACAGCGA GACTCTTGTC TCAAAAAAAA AAAAAGATAT TTGCTGAAAA 
23901 GACCCAGCCT GCCAAACTCA GGGGCAGCCA AGGGAGGTAG TGAAATGGAA 
23951 GTTGGAGCTC AGCGCTCCCA CACCTCCACT GCCCTCAGGC CTTCTCTGCC 
24001 TCTTTCCCAT CAGTCAGCTG CTTCTGGGCA TGGTCCTGGC AGAGACTTGG 
24051 CCTCCTTCCA GTTCAAGCTC CCTCTTAGAT TGTGTCCCAC GCCACTGAGT 
24101 CTTTGGGACA CTGGGTCAGA TGTCTAGTCT GGCACAATTG GCAGGAATCC 
24151 CAAGAAACAG TGTGAGTGAG GGGACAGTCG TGTTGAGTGC CCTCCATCTG 
24201 GGACTGGGAG GCAGGTCTAT GTCAGGCCTG CATTTAGATC TCTAATGGCT 
24251 CCAGACAAGC CCCTTCAGCT CACTAAGCCT GTTTCCTAAC ACAGCTGTGG 
24301 GATGGTGCTT TGGTTTACAT AGCACGCGAT ACCATCATAG ATCACATGGG 
24351 GAAACTGAGG CCCCAGGAGT GATCTGCTGG CACATGCAGT GACAAGAGGA 
24401 GAGGCCCATC TCAGCCTTGC AGCAAGGTTG CCAGAAATCG ATTCTCGCCC 
24451 CCATCCCGTA AAGATAGCTG GGATTACAGG TGTGCACCAC CATGCCCAGC 

24 501 CTAATTTTTG TATTATTAGT AGAGATGGGG TTTCACCATG TTGTCCAGGC 
24551 TGGTCATGAA CTCCTGACCT CAAGTGATCC ACCCGCTTTG GCCTCCCAAA 
24601 GTGCTGGGAT TACAAGCATG AGCCACAGTG CCTGGCCTGA CCCTGCTCTT 
24651 TTGAAAGACC ATTCCCCCAA ATTCTGTGCA CCTGTGTGCC TTTCTTCTCT 
24701 CTGCCTCCTC TCAGCTCTGC CCCGCTCTCC TCCCTTCTCC TCTGGCAAAT 
24751 CCCACTCATC TCTTGAAGCC CTTCTTCCAG GGGAAGCCCT GATCATGCTG 
24 801 CTTTCTCCTG TGGGAGGGAT GAAGGACGTG GCCCACGGAG TTTGTTTTGT 
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24851 TTTGTTTTGA GATGGAGTTT TGCTCATGTT GCCCAGGCTG GGGTACAATG 
2 4 901 GTACGATCTC AGCTCACTGC AACCTCTACG TCCCGGGTTC AAGCGGTTCT 
24 951 CCTGCCTTAG CCTCCCCAGT AGCTGGGATT ACTGGCATGA ACCACCACAC 
25001 CTGGCTAATT TTGTGTTTTT AGTAGAGATG GGGTTTCTTC ATGTTGGTCA 
25051 GGCTGGTCTC GAACTCCCAA CCTCAGGTGA TCTGCCTGCC TCGGCCTCCC 
25101 AAAGTACTGG GATTACAGGG TTGAGCCACT GTGCCTGGCC CAGGCCCACG 
25151 GAGTTTTAAG AGGCTTCCTG TGGCAGTGGC ATCCAGACGG AGTGCAGAAA 
25201 CTCAAAGTTG AAGGCCAGAA GCTCAGGGAA GGGGGAGTGT GAGTTGAGGA 
25251 GTCTCTTGGC TGCCAGGGCC AGAAACCGAA CTCCAAGCCT CTCCACAACA 
25301 GCGGGTGTAG AGCATGTAGA AT C AG AG AG G AGGCTGAGCC ATGCAGCCCC 
25351 GAGAAGAGGG GAATGCCACT GAGCCACAGA GACCCAGTGC CACTGCCAGG 
25401 TGTCTCTGCC TCCACTTCCC ATGACCCGGC CTGTCTCTGT ATGCAGGCTT 
25451 CACCCTCTCT CGTTGTACAT TGTACACATT CTAGGTGACA CCAGCAGCTT 
25501 CTGATTCTCA TCTCCCATAA CATCAGCCCC CCAGAGAGGG GACAACTGCT 
25551 GAGCTGATAA CAT A AT AG AT GCCCCTTTCC TGGAGGCCAT GGTCATGGTC 
25601 AGCGTGGAGA GGATGAAGCC TGAGCAGGCA GGATCGGGGG TCTAGAGGGG 
25651 AAGGAGGTGG AAGTTGAGAT CACAGACCTG TGGTCAGGTG GCCTGGGAAG 
25701 GGTTTGACGA GTGTCGGCCC AAAGAGCTTG GAAGGGATTT TGCTGCTGTG 
25751 GGTGAGCACT GCCTCTCCCC TTAGGGACAA CAGCCACCTC TTCTCTCCCC 
25801 ATTTGCCTTT CCCTTCTGTA GATATGAAAC ACAGGCCTCC TTGTCAGGCC 
25851 CCTACTTAAC CTCCGTGATG GGGAAAGCGG CCGGAGAAAG GGAGTTCTGG 
25901 CAGCTTCTCC GAGACCCCAA CACCCCACTG TTGCAAGGTG AGTCATGGCC 
25951 TGACACTCTG GATGTGTCCC CTACCCCAAG CTTACTCAGC CAAGAGGCTT 
2 6001 CATCAACTCA CCCCAGCTTT CCCTAGCACC CTCCTGGGCC ACACCTTCAC 
26051 AAAATCACTG ATGCTCAAAG TTGGATATAA TATATTGAAC TGAAGCCTTA 
26101 GCATTTTTAT GCAAGTTACT GTGGAAATTC TAGGAAACCA G AC AG ATT AC 
26151 AAAAAAAAAA AAAAACTAGA AGAAAATTAA CATCACCTAG GAT AT ACT AC 
r'l 2 6201 CTAGGAATAA CGTCTTTTAT TTTGAGATGG AGTTTCGCTC TTGTTGCCCA 

2 6251 GGCTGGAGTG CAGCGGTATG ATCTCGGCTC GCTGCAACCT CCGCCTCCTG 
%y 26301 GGTTCATGTG ATTCTTCCAC CTCGGCCTTC CTAGAGCCCA AGTGGTCTGC 

£0 26351 CTGCCTCTGC CTCCCAAAGT TCTGGGATTA CAGGCATGAG CCACCGCACC 

~F 26401 CAGCCAAAAT TACTTAACTT TTCTTCTAGA TACTTTTTAA AAATATGGCA 

- Bp 2 6451 GTAAGTTTTT CATAAAAAAT GGAGCCATGC TATCCAGTGG AAATTTAATG 

H 2 6501 TTGCCCACAT GTATAACTTA AAAATTTCAT ATATGTGTAT ACATATATAT 

M* 2 6551 GAAATATATA TATACAGACA CACATATATA TGTATACATA TATATACACA 

Ik 26601 TATATATGTA TACATATATA C AC AC AT AT A TGTATACATA TATATACACA 

Ul 2 6651 CATATACACA TATATACACA CACATACATA TATACACACA CAT AT AT AC A 

Kl 2 6701 CACATATATA CACACATGCA CACATATATA TGTATACATA TATACACACA 

2 6751 TGTATACGTA TATATACACA CAT AT AT AC A CACATATATA T AC AC AC AT A 
L 2 6801 TACACACATA CACACACATA TATACACACA TATATACACA CATATATACA 

^ 26851 CACATATATA TGTATACATA TATATACACA CATATATACA CATACACACA 

26901 TACATATATA CACATATACA CATATACACA CACATATACA CACATGTATA 
fs; 26951 CATATATATA CACACATGTA TACATATGTA TACACACACA TAT AT GT AT A 

IZ. 27001 CATATATACA CACATACATA TGTGTACATA TATACACACA TACATATGTA 

Ul 27051 TACATATATA CACACAT 

pi (SEQ ID NO:5) 
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Isoform 1: 



FEATURES : 

Exon: 2031-2094 

Intron: 2095-5569 

Exon: 5570-5762 

Intron: 5763-7571 

Exon: 7572-7741 

Intron: 7742-10000 

Exon: 10001-10173 

Intron: 10174-10298 

Exon: 10299-10485 

Intron: 10486-12027 

Exon: 12028-12193 

Intron: 12194-25821 

Exon: 25822-25939 



Allelic Variants (SNPs) : 

DNA 



Position 


Major 


Minor 


921 


C 


T 


1781 


C 


T 


1850 


G 


A 


2839 


A 


G 


3730 


G 


A 


6631 


G 


A 


6945 


- 


A 


6952 


A 


T 


7457 


G 


A 


7830 


T 


A 


8089 


T 


C 


8551 


C 


T 


9269 


G 


C 


9362 


C 


T 


9782 


G 


T 


11493 


G 


A T 


12260 


A 


G 


13086 


T 


C 


13183 


T 


C 


21240 


C 


G 


21695 


A 


G 


22058 


C 


T 


22233 


C 


A 


22245 


C 




22375 


C 


T 


23042 


A 


- T 


23344 


T 


C 


23873 


A 




24764 


G 


T 


24939 


T 


C 


24945 


G 


A 


25092 


C 


T 


25428 


T 


G 


25513 


C 


T 


25684 


C 


T 


26165 


A 





Domain 

Beyond ORF (5') 

Beyond ORF(5 1 ) 

Beyond ORF ( 5 ' ) 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Intron 

Beyond ORF (3' ) 



Protein 

Position Major Minor 



Context : 
DNA 

Position 

921 TTGGAGATATTTTAAGGTCATAGTGTCTTCACAAATTGAGCTGAAAGGGAACTGTTAGGA 
TGATCTTGCCTAACCCTCTCATCTCACACAGGAAGAACTATTTTAAACTCGAGAGGTTAA 
GTGACCTGGCCAAAGTCACACAGCCACCACTAGTTAACTCGTATACATTGATTCTCCTGT 
GGGGCTGGGCAGATGAGGAATCTTTTGTTCTCTTCCCTGTTTGCAGAGATTTTTTTTGAG 
GTTACTTTCCGAGTTCTGGCAAGTACCCCTGCTTCTGGTAGCTTTGTGTCTCGATTCAAT 
[C,T] 

TCATTCTTTTTATTTTATTTTATTTTTGAGACAGGGTCTCACTTTGTCACCCAAGCTGGA 
GTGCAGTGGTGTAATCTTGGCTCACTGTAGCCTCCACCTCTTGGGTTCAAGCGATCCTCC 
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TGCCTCAGCCCCCCAAGTAGCTGGGATTACAGACGTCTGCCACCACGCCAGGCTAATTTA 
TGGTTTTTTGTATGTGTTTTTTGTGTTTTTGTAGAGACAGTGTTTCCCCATGTTGCCCAG 
GCTGGTCTCCAACTCCTGAGCTCAAGTGATCTGCCCGCCTCAGCCTTTCAAAGTGCTAGG 

17 81 ACAGCTGACTCCAGCAATGCTGCTC ACGTGACCACTGCAGCTGCAGCTCCCGTTCC ACTC 

CTTGTCCTGGGCTAGGTGGGCACTACCAGGGGCTCCTTTGGTAAGGAGTACCGGGTAGGC 
ACCCGGTCCTGCCAATCCACCACTGGAACAGCTGGGGGGACAGCAGACAGGCACGGTCGG 
ACAGACTTGACAGATCAGGCATCAGGCCCTCTGCGCTGGTCCCGGGCTCTTTAAGCAGGA 
ACGTGAATGGCCTCAAGATGTCTCACATGGTCCCACTAGCCCTCCTCCTCCCTTTGTTCC 
[C,T] 

TACCTCCAGGAGGGCTGCTCTGCCCTTCCTTCCTCTGTTCTTTGGCCTTATGTTCCCCGC 
CACCACAGGCCTTCCCCCGCCCCACCCCTCTGCAGACTTAGCCGTGCATTGCAGGCATGG 
AGGATTAATCAGTGACAGGAAGCTGCGTCTCTCGGAGCGGTGACCAGCTGTGGTCAGGAG 
AGCCTCAGCAGGGCCAGCCCCAGGAGTCTTTCCCGATTCTTGCTCACTGCTCACCCACCT 
GCTGCTGCCATGAGGCACCTTGGGGCCTTCCTCTTCCTTCTGGGGGTCCTGGGGGCCCTC 

1850 GGCTAGGTGGGCACTACCAGGGGCTCCTTTGGTAAGGAGTACCGGGT AGGCACCCGGTCC 

TGCCAATCCACCACTGGAACAGCTGGGGGGACAGCAGACAGGCACGGTCGGACAGACTTG 

ACAGATCAGGCATCAGGCCCTCTGCGCTGGTCCCGGGCTCTTTAAGCAGGAACGTGAATG 

GCCTCAAGATGTCTCACATGGTCCCACTAGCCCTCCTCCTCCCTTTGTTCCCTACCTCCA 

GGAGGGCTGCTCTGCCCTTCCTTCCTCTGTTCTTTGGCCTTATGTTCCCCGCCACCACAG 
f G, A] 

CCTTCCCCCGCCCCACCCCTCTGCAGACTTAGCCGTGCATTGCAGGCATGGAGGATTAAT 
CAGTGACAGGAAGCTGCGTCTCTCGGAGCGGTGACCAGCTGTGGTCAGGAGAGCCTCAGC 
AGGGCCAGCCCCAGGAGTCTTTCCCGATTCTTGCTCACTGCTCACCCACCTGCTGCTGCC 
ATGAGGCACCTTGGGGCCTTCCTCTTCCTTCTGGGGGTCCTGGGGGCCCTCACTGAGATG 
TGTGGTGAGTAACTCGCCTCTATCCTGTGCCTCTTTCCTCCTGGGTCCTTAGTGGGGTGG 

2839 AA C AT AGG G AG AC CC CAT C T C T AC AA AAAA T AAAAAAAAT T AAAAA AT AGC TGGGCATGG 

GGAAGACTTTCTGAAGACCAAGAGGACACATGGGAGCTGAAACTCGAAGGAAGAAAAGGA 

GCTGGCAGGAAAGGAGTGGGGGACACACATTCTAGGCAGCAGGAAGTGAGCCTTCGGAGG 

TCCTGCCTGCTCCAGCTCTGTGCCCCAAGGGGTCTCTTGGAGCACAGTCTCCTGGGACCT 

GTCTATGAGTCTGAGCTTAGAGGCTCAGGGCTGCTCCTTCAGACAGGAGGCAGAAGGCAG 
[A,G] 

CTTTGGGAACTTTGGGCCGCCCACGCGCCTTTTCTCCTCCTCTGCACCTAGGATTACGTT 
GAGCAATACACTTTCACCCCCATGGTCTCTTGAGACCCTGGGGAAACCCTGAGAGGTGGG 
TGCAGTCATGTCCAGGTGTCAAGTGAAGAAGTCGAGGGTTGGAGGGGCTGAGTGACCCAC 
TCAGGGTGCTCCACCTTTTCCAGAGCTTTGCTGAACTTAGTTTTTAGAACTTGAAGCCTC 
GTTTGTTTTCGTTTTGTTTTTTGTTGAGAGAGGTTCTCCCTCTGTTGCCCAGGCTGGAGT 

3730 GACACCTCAGGTCTGGGCCCAGGAACCCCAGCTCTTGGTTCATGTCCGGACAGTCCCCAG 

GGGAGTTCTGGGTTCAACCAGCAAGAGCTCTTCCTCCTGGCTGATCTGGTCCTCAGCCTT 

GGACAGTTAGTCCATTAACCTGACCCCACAGGAGCCCCAATCCCTTGGGGTCTGGGGAAT 

CTTGAACTGGGGTTTGGGGTGCAAATATCTGCACTGAGTCACTTAATTGCACCCAGCCTC 

ATTCCTTTATCTGTAAAGTGGGCTAAGAATGCTCCCCTGCCTTCCTCCTCGGTGTAGTAC 
[G, A] 

AGGAAGGATCCCATGACACCTGCTCTCCCAGTTTAAAGCTCTATATGTATGTTGTGAAAT 
TGACAGGGATCGCTGCACAAACGCTAATGCAAAGTGGGCTCCTGTGCTTCCTTTTCTCTT 
TCTTCTTCTTTTTTTTTTTTTTAATTTTCTTCTAGAGATGAGGTCTCACTATATTGCCCA 
GGGTTGGTTTCAAACTCCTAGGGTCAAGCGATCCTCCCACCTTGGCCTCCCAAACTGCTG 
GTATTACAGGCGTGAGCCACTCTGTCTGGCTCCTATGCTTGTGAATGTCAACAGCAATCA 

6631 TGAATAGCTGGGATTACAGGCGTGTGCCACCATGCCCAGCTGATTTTTGTATTTTTGGTA 
GAGATAGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCG 
CCTGTCTTGGCTTCCCAAAGTGTTGGGATTATAGGCATGAGCCACTGCACCAATCCAAAA 
GCAGCATCTTTGTGCTCCCTTTTCAAGAGGCATCACAGAGAGGCCTGTTTTGGGGTTTGA 

ATGAGAGGCGAAGAATCAGCCATGGAGTGCCTCTTTCTCAGACTCCCTCTTGAGAAGTGG 
[G,A] 

TGCAGGGGTGGAGAGAAAAGAAGACTAGGCATAGTGGCTCATACCTGTAATCCCAACATT 
TTGGGAGGCTGAGGCAGGAAGATTGCTTGAGCTCAGGAGTTTGAGACCAGCCTAGGCAAC 
ATAGTGAGACCACATCTCTTAAAAAAAAGAAAAAGAAAAAAAATGAGCCAGGTGTAGTGA 
CTCATGCCTGTGGTCCCCACTTCTCCGGAGGCAAAGGTGGGAGGATCTTTTGAGGCTGAG 
AAATCGAGGCTACAGTGAGCCATGGTGGCACCACTGCACTCCAGCCTGGGAGACAGAGAG 

694 5 AGAAAAGAAGACTAGGCATAGTGGCTCATACCTGTAATCCCAACATTTTGGGAGGCTGAG 

GCAGGAAGATTGCTTGAGCTCAGGAGTTTGAGACCAGCCTAGGCAACATAGTGAGACCAC 

ATCTCTTAAAAAAAAGAAAAAGAAAAAAAATGAGCCAGGTGTAGTGACTCATGCCTGTGG 

TCCCCACTTCTCCGGAGGCAAAGGTGGGAGGATCTTTTGAGGCTGAGAAATCGAGGCTAC 

AGTGAGCCATGGTGGCACCACTGCACTCCAGCCTGGGAGACAGAGAGACCCTATCTCAGT 
E-,A] 

AAAAAAAAAAATAAAAATATGGCTGGGTGTGGTGGCTCACGCCTGTAATCCCAGCACTTT 
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GGGAGGCCAAGGTAGGTAGATCACATGAGGTTAGGAGTTCGAAACCAGTCTGGCCAACAT 
AGTGAAACCCTGTCTCTACTGAAAATACAAAAAATTAGCCAAGGGTGGTGGTGGGCAACT 
GTAATCCCAGCTACTTGGGAGGCCGAGGCAGAAGAATCGCTTGAACTCGGGAGGCGGAGG 
TTGCAGTGAGCTGAGAACATGC 

6952 AAGACT AGGCAT AGTGGCTCATACCTGTAATCCCAACATTTTGGGAGGCTGAGGCAGGAA 

GATTGCTTGAGCTCAGGAGTTTGAGACCAGCCTAGGCAACATAGTGAGACCACATCTCTT 
AAAAAAAAGAAAAAGAAAAAAAATGAGCCAGGTGTAGTGACTCATGCCTGTGGTCCCCAC 
TTCTCCGGAGGCAAAGGTGGGAGGATCTTTTGAGGCTGAGAAATCGAGGCTACAGTGAGC 
CATGGTGGCACCACTGCACTCCAGCCTGGGAGACAGAGAGACCCTATCTCAGTAAAAAAA 
[A,T] 

AAAATAAAAATATGGCTGGGTGTGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGC 
CAAGGTAGGTAGATCACATGAGGTTAGGAGTTCGAAACCAGTCTGGCCAACATAGTGAAA 
CCCTGTCTCTACTGAAAATACAAAAAATTAGCCAAGGGTGGTGGTGGGCAACTGTAATCC 
CAGCTACTTGGGAGGCCGAGGCAGAAGAATCGCTTGAACTCGGGAGGCGGAGGTTGCAGT 
GAGCTGAGAACATGCCACTGCACTCCAGCCTGGGCAACAAGAGCGAAACTCTGTCTCAAA 

7457 AAGAATCGCTTGAACTCGGGAGGCGGAGGTTGCAGTGAGCTGAGAACATGCCACTGCACT 
CCAGCCTGGGCAACAAGAGCGAAACTCTGTCTCAAAGAAAATAAATAAATAAAATAAAAA 
AATAAAAAAGGAGGGGGCATATGGGTGAAGTATGGACAAAATAGTGGGGCAGGCACAGAT 
GATCTGGACACAGGAGCCCTTGGAGTTTATTCTTGAATCTAACTGTTCATCTTTATTAAA 

TATTTGTGGCATACACCTCACAACAACATAGCCAACACACCTCCTTTTGGAGCTTTTATC 
[G, A] 

AAGTTTCCCACTGTTAAGATTTTTTCCCGCTTTGTGATGCGGGTGGGGTGGGTGCTGTAA 
GCAGGCTTACGGGGTGGCAGTTTCTCACAAAGGCATTAACTGGCCTTGTCCTAGGTCTGC 
CTTCAGCGAGGATGACGGTGACTGCCAGGGCAAGCCTTCCATGGGCCAGCTGGCCCTCTA 
CCTGCTCGCTCTCAGAGCCAACTGTGAGTTTGTCAGGGGCCACAAGGGGGACAGGCTGGT 
CTCACAGCT C A AATGG T T C C T GG AGGAT G AG AAG AGA GC CAT T GGT G AGC AG AC AC C AT C 

7830 GGTGGCAGTTTCTCACAAAGGCATTAACTGGCCTTGTCCTAGGTCTGCCTTCAGCGAGGA 

TGACGGTGACTGCCAGGGCAAGCCTTCCATGGGCCAGCTGGCCCTCTACCTGCTCGCTCT 

CAGAGCCAACTGTGAGTTTGTCAGGGGCCACAAGGGGGACAGGCTGGTCTCACAGCTCAA 

ATGGTTCCTGGAGGATGAGAAGAGAGCCATTGGTGAGCAGACACCATCCGCTGGGGGTGG 

GGAGCAGCTGGGAGGGCTCATCAGATGATATTCTCCAATGAGAATCAGAACTTTGGGTTT 
[T,A] 

CTCCCCAGGCGTCTTTCCCACCATCCATTCTGCCCATCTCACTGCCTACGTAGAGGCTCG 
AACCTGTCCCCATAGCCATCCTTGACCCAGCTTTTCCCGCGCTGCACACATACTATTGAC 
AGGTGTGTTTCGTGGTTTTTTGTTTTTTGTTTGTTTGTTTGTTTTGAGTTGGAGGTTTGC 
TCTTGCTGCCCAGGCTGGAGTACAATGGCGCAATCTCAGCTCACCGCAATCTCTGCCTCC 
TGGGTTCAAGCAATTCTCTTGCCTCAGCCTCCTGAGTAGCTGGGATTACAGGCATGCGCC 

808 9 ATCAGATGATATTCTCCAATGAGAATCAGAACTTTGGGTTTTCTCCCCAGGCGTCTTTCC 
CACCATCCATTCTGCCCATCTCACTGCCTACGTAGAGGCTCGAACCTGTCCCCATAGCCA 
TCCTTGACCCAGCTTTTCCCGCGCTGCACACATACTATTGACAGGTGTGTTTCGTGGTTT 
TTTGTTTTTTGTTTGTTTGTTTGTTTTGAGTTGGAGGTTTGCTCTTGCTGCCCAGGCTGG 
AGTACAATGGCGCAATCTCAGCTCACCGCAATCTCTGCCTCCTGGGTTCAAGCAATTCTC 
[T,C] 

TGCCTCAGCCTCCTGAGTAGCTGGGATTACAGGCATGCGCCACCACACCCAGCTAATTTT 
GTATTTTTAGTAGACGTGGGGTTTCTCCATGTTGGTCAGGCTGGTCTCGAACTCCTGACC 
TCAGGTGATCCGCTTGCCTTAGCCTCCGAAAGTGCTGGGATTACAGGCATGAGCCACTGC 
GTTAGGCCCACTGACAAGCCTTGTATTGGCTAGCCACCAAGATTGACTTGATTATCCACC 
TTCGGGACAACTGGACAGCCTGCTTATGACTTACGCCATAGTCTGTCTCTACTAGCTCTC 

8551 TACAGGCATGAGCCACTGCGTTAGGCCCACTGACAAGCCTTGTATTGGCTAGCCACCAAG 
ATTGACTTGATTATCCACCTTCGGGACAACTGGACAGCCTGCTTATGACTTACGCCATAG 
TCTGTCTCTACTAGCTCTCCTGCCCTGACTTGACCCAGCATACAACAGCCAGAGCCAGCC 
TTTTCAATATAAACCTGATCTTGCTGGCACTGCTTAAACCCTGCAGGGGCCTCGCACTGC 
TCCATGGCCCAGCCTGTCTACCCTTACCTTCTGCCCAGGCTCTGCTCATCCATTCTCTGC 
[C,T] 

TCCCACACACCTGCCCTCTGTGGGCTCCAGCCATACCATCTCTCAACTCATAAGCCAGTT 
TTTTCATACAGGCTCCCTCCATCTGGACTGGCTTCCCTGCGTGCAGTTCACTCCTGCTCT 
ACCTTTGGCTCTGCCTCCACCCATCCTCAGCCGTCTCCAGCATTACCTCCTTGGAGAATC 
CTGCCTTGACTTCCCAGCCACCCAAATATCACTACTTGGTCTGCATTCTCGTTGCAATTG 
CAGTCGCATGAGCAATTGCTGTGGTTGAGGCCCGAACTGCGCAAGTGCCTGTCTGCCATG 

9269 AGGCCAGGGTCCCAGGTGCTGGCGGGGCTGGCTGCTGGGTGGGGGCAGAGAGGCAACCCC 
TCTGTTTTTTTCCCTCTCAGGGCATGATCACAAGGGCCACCCCCACACTAGCTACTACCA 
GTATGGCCTGGGCATTCTGGCCCTGTGTCTCCACCAGAAGCGGGTCCATGACAGCGTGGT 
GGACAAACTTCTGTATGCTGTGGAACCTTTCCACCAGGGCCACCATTCTGTGGGTGAGTA 
GGTCAGACCGTGCCAAGGCCAGGCTGGCACTCCCTCAGTCCCCAGGTCTGCACTGATGAC 
[G,C] 
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TCCATACCCTGGCCCCCACACTCACCTTTCCTTGGGGCTCCTCCGAATCAAGTCCTTTAG 
GGACGAATTGGCGAGGGCTCATGGGTGATGCTCCAGCTGTGAGCCAGCTTTGGAGCTGGT 
AGGTGGATCTCTTGAGGCCAGGAGTTCAAGACAACGTGGTGAAACCCCATCTCTACTAAA 
AATAAAAAAGTTAGCCGGGCATGGTGGCACATGCCTGTAGTCCCAGCTACTCGGGAGGCT 
GAGGCAGGAGAATCACTTGAACCTGGGAGGCGGAGGCTGCAGTGAGTGGAGATCGCACCA 

93 62 GGGCCACCCCCACACTAGCTACTACCAGTATGGCCTGGGCATTCTGGCCCTGTGTCTCCA 
CCAGAAGCGGGTCCATGACAGCGTGGTGGACAAACTTCTGTATGCTGTGGAACCTTTCCA 
CCAGGGCCACCATTCTGTGGGTGAGTAGGTCAGACCGTGCCAAGGCCAGGCTGGCACTCC 
CTCAGTCCCCAGGTCTGCACTGATGACGTCCATACCCTGGCCCCCACACTCACCTTTCCT 
TGGGGCTCCTCCGAATCAAGTCCTTTAGGGACGAATTGGCGAGGGCTCATGGGTGATGCT 
[C,T] 

CAGCTGTGAGCCAGCTTTGGAGCTGGTAGGTGGATCTCTTGAGGCCAGGAGTTCAAGACA 
ACGTGGTGAAACCCCATCTCTACTAAAAATAAAAAAGTTAGCCGGGCATGGTGGCACATG 
CCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCACTTGAACCTGGGAGGCGG 
AGGCTGCAGTGAGTGGAGATCGCACCACTGCCCTCCAGCCTGGGCAACAGAGTGAGTGAG 
ACTCTGTCTCAAAAAATAAAAAATAAAATAAAACTCCCCTAGTGATTCCAATGTGCAGCT 

9782 GCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCACTTGAACCTGGGAGGCG 
GAGGCTGCAGTGAGTGGAGATCGCACCACTGCCCTCCAGCCTGGGCAACAGAGTGAGTGA 
G AC T C T GT C T C A AAAAAT AA A AAAT AA AAT AA AAC TC C C C T AGT G ATT C C A AT GT G C AG C 
TAAGTTTGGAAATAGGTGGTATGGGGTCAAGTCCTCTTGGGCCTCCCTCCTCCAGTCCTT 
CTCCCTAACCTCTAGCCCTCAAGTTGCAGAGTGATCAGCCAAACCAGTTTGCCCAGAAAT 
EG, T] 

AGCAGTTTCCTGGGACACAGGATTTTCAGAGTCCAGACAAGGAAAGTCTTGGGCAGACCA 
GGTTGAGTTGGTGCCCTTAGCTGATCTGACCATGTTGCCCTTCTTCTCCAAGCCCTCCTG 
TGGTTGTCCATAGCTACAAGGGCCTGACCCTCAAGCCCCTGCCTGTCCTGGCCCCTTTGG 
CTCTCCAGCTCATTGCATGTTCTGTCCCCCACTTCAAGACACAGCAGCCATGGCAGGCTT 
GGCATTCACCTGTCTGAAGCGCTCAAACTTCAACCCTGGTCGGAGACAACGGATCACCAT 

11493 AAAAAAAAAAATGGAGAAGAAGGAAGCTGGACATGGTGGCTCGTGCTTATAATCCTAGCA 
CTCTGGGAAGCTGAGGCAGATGGATTGCCTGAGCCCAGGAGTTTGAGACCAGCCTGGGCA 
ACATGGTGAAACCCTGTCTTTACTAAAATACGAAAGATTAGCCAGGCATGGTGGTAGACA 
CCTATAATCCCAGCTACTAGGGAGGCTGAGCCACAAGAATCACTTGAACCTGGGAGACAG 
AGGTTGCAGTGAGCCGAGATCGCGCCATTGCACTCCAGCCTGGGCGACAGTGTGAGACTC 
[G,A,T] 

GTCTCCAGAAAAAACAAGAATGGATAGAGTGGAGCCAAGAAGAGGCAGGAAGAACAAAGA 
CACAGAGGTGCACAGAGTTTGGGGGAATTTTGAGGAATGGTCTTGCAAAAGAGTGGGATC 
TGGGAGAATGAGTGGGAGTGGAAAGCAGATGAATGAAGAGAAGGTGAGCGCATCAGGGTA 
ACAGAGATGCGTTGTGAACAAATGCATGTTCTAGGAAGAGCCCTCTGGAGTGCTAGGTGC 
CAGAGAGGTGGGAGGAAGGATACTGGAAGCAGAGAAACCAGTGAGGGGCCTGATCTTGGG 

122 60 ACAAGAAGACAAATAATCCAGGCTCTCTGTCCTCACACCAGCTGCCCGCCCCTTTCTTCC 
TGGCACAGTCATGTTGGAACCAGCTGCTGAGACCATTCCTCAGACCCAAGAGATCATCAG 
TGTCACGCTGCAGGTGCTTAGTCTCTTGCCGCCGTACAGACAGTCCATCTCTGTTCTGGC 
CGGGTCCACCGTGGAAGATGTCCTGAAGAAGGCCCATGAGTTAGGAGGATTCACGTGAGA 
CTCCCACCTCCCAGTCCTCACCCCACCCAACCTCACATGCCTGATAACAGGGTCACAGAA 
[A,G] 

AGACGGGGAACAGAGGAGAGGGTTCCCTCGGGAGAGACACTGGCCCTGCTTCTGCTTCTA 
CCTGCTCAGCTCCTTTCTTGCCCACGGTGTTATGGAAACAGGGAGCCATAGGCCAGCATT 
GTCACTGAGAGAGCAGGCTTTGGAGGCAGAGCCCCCCAGTTGGAATCCCAACTCTAACCA 
GCTAGGTTCCAGGTAGGCACCCACAATTCACCGAGGAGAACAGTTGTGCCCCTTCCCTGC 
AGGGCCAGTGTGAAGAGTCCAGGAGTTAGTACACATAGAGATAGTGGCATGTGCTTTTTA 

1308 6 GGCACGTGCCACACAACCCTGGGTAATTTTTTTTTTTTTTTTTTTGAGATAGGGTCTCTG 
TCTGTTGCCCAGGCTGGTCTCAAATTCCTGGCCTCAAACCATCCTCACACCTGAGGCGCT 
CAAAATATTGGGATTATAGGTGCGAGCCATCATGCTCAGCCAGAATAATAACTGGTTTTT 
TTTGTTTTTTTTTTGAGACAGAGTCTCACTCTATTACCCAGGCTCTGGAGGCCCAACTCG 
TGTTTGTGTATTTGTTTATTTTTATTTATTTATTTATTTCGAGACAGAGCCTCTCTCTTT 
[T,C] 

ACCTAGGCTGGAGTGCAGTGGCGCAATCTCGGCTCACTGCAACCTCCGTCTCCTGGGTTC 
AAGTGATTGTCCTGCCTCAGCCTCCTGAGTAGCTGGTGCTACAGGCGCGTGCCACCATGC 
CCAGCTAATTTTTGTATTTTTAGTAGAGACAGGGTTTTACTATGTTGGCCAGCTGGTTTC 
TAACTCCTGAACTCGGGTGATCTGCCTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGC 
ATGGGCCTCCGTGCCCGGCCATGTATTTATTTAGGCAAGGTCTCTCTCTGTTATCCAGGC 

13183 ACCATCCTCACACCTGAGGCGCTCAAAATATTGGGATTATAGGTGCGAGCCATCATGCTC 
AGCCAGAATAATAACTGGTTTTTTTTGTTTTTTTTTTGAGACAGAGTCTCACTCTATTAC 
CCAGGCTCTGGAGGCCCAACTCGTGTTTGTGTATTTGTTTATTTTTATTTATTTATTTAT 
TTCGAGACAGAGCCTCTCTCTTTCACCTAGGCTGGAGTGCAGTGGCGCAATCTCGGCTCA 
CTGCAACCTCCGTCTCCTGGGTTCAAGTGATTGTCCTGCCTCAGCCTCCTGAGTAGCTGG 
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[T,C] 

GCTACAGGCGCGTGCCACCATGCCCAGCTAATTTTTGTATTTTTAGTAGAGACAGGGTTT 
TACTATGTTGGCCAGCTGGTTTCTAACTCCTGAACTCGGGTGATCTGCCTGCCTCGGCCT 
CCCAAAGTGCTGGGATTACAGGCATGGGCCTCCGTGCCCGGCCATGTATTTATTTAGGCA 
AGGTCTCTCTCTGTTATCCAGGCTGAAGTGCAGTGGCACATTCATAGCTCACTGCAGCCT 
CAAATTATCCAAGTAACAGGGACTACAGGCATGCACCACCACACCCATCTACTTTTTTTT 

2124 0 TCAGCTCACTGCAACCTCTGCCTCCCTGGCTTAAGCGATCCTCCCACCTCAGCCTCCTGA 

GTACGTGTGACCATAGGCCCATGGCACAAAGCCCAGCTAATTTTTTGTATTTTTAGTAGA 

AATGTGGTTTCACCATGTTGCATAGGCTGGTCTCGAACTTCTGAACTCAAGTGATCTGCC 

TGCCTTGGCCTCCCAAAGTGCTGGGATTCTAGGTATGAGCCACCCTGCTCGGCCTATAAT 

GGCACTTTCCTATCCCATTGATGAGGCTCTACTCTCATGACCTAATCATCTCCCAAAGGC 
EC,G] 

CTAAGGCCTCCTGATACCATCACCTTTGGGGTTAGGTTTTAACATATACATTTTGGGGGG 
ACACAGACATTTTAGACCATAGCACCTCCATTGAAAGGAAACATTTCTGACACCTGGCTA 
TCTCAAAGGGCCCTTTCAGTTCCCCTGCAGGCTGCATTCCCACATCACCAACAAGAGCAG 
CGACACTCACTCAGAGGTTAAATAACTTGTCCAGAGTCACAGCAGTAATGAATGACAGAG 
CTGGGGCTTGAATCCAGGCGTCCTCCTAGAGCCTGGATTCTGTGTAGTGAGTGAAAGCTG 

21695 CATTCCCACATCACC AACAAGAGCAGCGACACTCACTCAGAGGTTAAATAACTTGTCCAG 

AGTCACAGCAGTAATGAATGACAGAGCTGGGGCTTGAATCCAGGCGTCCTCCTAGAGCCT 
GGATTCTGTGTAGTGAGTGAAAGCTGACTCCTGGGAGACTTCTGCGTGGTCCTGGTTCTC 
TCTCCAGACTGCACTGCGCAAGTTTCTCTTCCTGATGGTCCCTAGGGTATTACAAAGACA 
GTGGCCCTGCCTGTCAGGTGTTTTTATTACCAGATGAGGTCATGGCCTCAGGAACCCTGT 
[A,G] 

GGAAGCTGAGTTCAGAGTCTTTGAGCAGGCTTTAGGGAGGTTCCAGCTTCCCACCACCAA 
GCCCCAGGTGGATTCTTACAGACTCTAGCCTCAGGGTGGGGGGTCTGGAAGATGAGGTTG 
CGGGGTGCGATATTCTGCCCAATTCGCCCCTCCTTGCTCAATCTGTTTCTGCAGGTATTG 
CTGACTACAGACCCAAGGATGGAGAAACCATTGAGCTGAGGCTGGTTAGCTGGTAGCCCC 
TGAGCTCCCTCATCCCAGCAGCCTCGCACACTCCCTAGGCTTCTACCCTCCCTCCTGATG 

22058 CCCAGGTGGATTCTTACAGACTCTAGCCTCAGGGTGGGGGGTCTGGAAGATGAGGTTGCG 
GGGTGCGATATTCTGCCCAATTCGCCCCTCCTTGCTCAATCTGTTTCTGCAGGTATTGCT 
GACTACAGACCCAAGGATGGAGAAACCATTGAGCTGAGGCTGGTTAGCTGGTAGCCCCTG 
AGCTCCCTCATCCCAGCAGCCTCGCACACTCCCTAGGCTTCTACCCTCCCTCCTGATGTC 
CCTGGAACAGGAACTCGCCTGACCCTGCTGCCACCTCCTGTGCACTTTGAGCAATGCCCC 
[C,T] 

TGGGATCACCCCAGCCACAAGCCCTTCGAGGGCCCTATACCATGGCCCACCTTGGAGCAG 
AGAGCCAAGCATCTTCCCTGGGAAGTCTTTCTGGCCAAGTCTGGCCAGCCTGGCCCTGCA 
GGTCTCCCATGAAGGCCACCCCATGGTCTGATGGGCATGAAGCATCTCAGACTCCTTGGC 
AAAAAACGGAGTCCGCAGGCCGCAGGTGTTGTGAAGACCACTCGTTCTGTGGTTGGGGTC 
CTGCAAGAAGGCCTCCTCAGCCCGGGGGCTATGGCCCTGACCCCAGCTCTCCACTCTGCT 

22233 CCCTGAGCTCCCTCATCCCAGCAGCCTCGCACACTCCCTAGGCTTCTACCCTCCCTCCTG 

ATGTCCCTGGAACAGGAACTCGCCTGACCCTGCTGCCACCTCCTGTGCACTTTGAGCAAT 

GCCCCCTGGGATCACCCCAGCCACAAGCCCTTCGAGGGCCCTATACCATGGCCCACCTTG 

GAGCAGAGAGCCAAGCATCTTCCCTGGGAAGTCTTTCTGGCCAAGTCTGGCCAGCCTGGC 

CCTGCAGGTCTCCCATGAAGGCCACCCCATGGTCTGATGGGCATGAAGCATCTCAGACTC 
[C,A] 

TTGGCAAAAAACGGAGTCCGCAGGCCGCAGGTGTTGTGAAGACCACTCGTTCTGTGGTTG 
GGGTCCTGCAAGAAGGCCTCCTCAGCCCGGGGGCTATGGCCCTGACCCCAGCTCTCCACT 
CTGCTGTTAGAGTGGCAGCTCCGAGCTGGTTGTGGCACAGTAGCTGGGGAGACCTCAGCA 
GGGCTGCTCAGTGCCTGCCTCTGACAAAATTAAAGCATTGATGGCCTGTGGACCTGCTAC 
AGTGGCCTGGTGCCTCATACTCCTCAGGTGCAGGGGCAGGGACAAGAGAAGGGGGAAGTA 

22245 TCATCCCAGCAGCCTCGCACACTCCCTAGGCTTCTACCCTCCCTCCTGATGTCCCTGGAA 

CAGGAACTCGCCTGACCCTGCTGCCACCTCCTGTGCACTTTGAGCAATGCCCCCTGGGAT 

CACCCCAGCCACAAGCCCTTCGAGGGCCCTATACCATGGCCCACCTTGGAGCAGAGAGCC 

AAGCATCTTCCCTGGGAAGTCTTTCTGGCCAAGTCTGGCCAGCCTGGCCCTGCAGGTCTC 

CCATGAAGGCCACCCCATGGTCTGATGGGCATGAAGCATCTCAGACTCCTTGGCAAAAAA 
EC,-] 

GGAGTCCGCAGGCCGCAGGTGTTGTGAAGACCACTCGTTCTGTGGTTGGGGTCCTGCAAG 
AAGGCCTCCTCAGCCCGGGGGCTATGGCCCTGACCCCAGCTCTCCACTCTGCTGTTAGAG 
TGGCAGCTCCGAGCTGGTTGTGGCACAGTAGCTGGGGAGACCTCAGCAGGGCTGCTCAGT 
GCCTGCCTCTGACAAAATTAAAGCATTGATGGCCTGTGGACCTGCTACAGTGGCCTGGTG 
CCTCATACTCCTCAGGTGCAGGGGCAGGGACAAGAGAAGGGGGAAGTAACCCCATCAGGG 

22375 ACAAGCCCTTCGAGGGCCCTATACCATGGCCCACCTTGGAGCAGAGAGCCAAGCATCTTC 
CCTGGGAAGTCTTTCTGGCCAAGTCTGGCCAGCCTGGCCCTGCAGGTCTCCCATGAAGGC 
CACCCCATGGTCTGATGGGCATGAAGCATCTCAGACTCCTTGGCAAAAAACGGAGTCCGC 
AGGCCGCAGGTGTTGTGAAGACCACTCGTTCTGTGGTTGGGGTCCTGCAAGAAGGCCTCC 
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TCAGCCCGGGGGCTATGGCCCTGACCCCAGCTCTCCACTCTGCTGTTAGAGTGGCAGCTC 
[C,T] 

GAGCTGGTTGTGGCACAGTAGCTGGGGAGACCTCAGCAGGGCTGCTCAGTGCCTGCCTCT 
GACAAAATTAAAGCATTGATGGCCTGTGGACCTGCTACAGTGGCCTGGTGCCTCATACTC 
CTCAGGTGCAGGGGCAGGGACAAGAGAAGGGGGAAGTAACCCCATCAGGGAGGAGTGGAG 
GGTGCCTGAGCCGCCATGTGGGCATTGGGGGAGTGATGGGAATGCCAGCAGTGATGACGT 
TGACTACTGACTGAGCACCCACTACTATGACTGAGCACTCACTCGCTAGATACTATCTTG 

23042 GCCGGGCGTGGTGGCTCACGCCTGTAATCCCAGCACTTTAGGAGGCCAAGGC AGGTGGAT 

CACAAGGTCAGGAGTTTGAGATCAGCCTGGCCAACATGGTGAAACTCCATCTTTACTAAA 
AATACAAAAAATTAGCCAGGCATGGTGTTGCATGCCTGCATGCCTGTAATCCCAGTTACT 
TGGGAAGCTGAGGCAGGAGAATTGCTTGAACCCTGGAGGCGGAGGTTGTAGTGAGCCGAG 
ATCACGCCATTGCACTCCAGCTTGGGCAAGAAGAGAAACACTCTCAAAAAAAAAAAAAAA 
[A,-,T] 

CAGGAAACTGGTGCTCAAAAAGGAAAAGTGACTCACCAAGGTCACAGACTAGGCAGTGAT 
GCTGGGGGAACCTGGCTCAGGGGACACAGACCTGGCCTGGGGCAGCCTTGCAGCTCCTCC 
ACTAAAATACTGAAAATGAGGGGCTTCGATGATGGTTATAATCGTATGGCAGAGCCCCAA 
CTCAACTGGAGCCCTGGGACCCAGAAGCTAGGGTCTCACTCCCTGCTTTTCCACAAGGCA 
CCATTAGGGCATCACCCCAGGCCTCGGCAGCCACGACGCAGGGATCCTGCCTCTCATTGG 

23344 AGGAAACTGGTGCTCAAAAAGGAAAAGTGACTCACCAAGGTCACAGACTAGGCAGTGATG 
CTGGGGGAACCTGGCTCAGGGGACACAGACCTGGCCTGGGGCAGCCTTGCAGCTCCTCCA 
CTAAAATACTGAAAATGAGGGGCTTCGATGATGGTTATAATCGTATGGCAGAGCCCCAAC 
TCAACTGGAGCCCTGGGACCCAGAAGCTAGGGTCTCACTCCCTGCTTTTCCACAAGGCAC 
CATTAGGGCATCACCCCAGGCCTCGGCAGCCACGACGCAGGGATCCTGCCTCTCATTGGT 
[T,C] 

GGGGGCTTAGGGGCTCTGGGCTGCCCTCTTGAAGAGGGGGTTCAGCCCAGCGAGGCACCC 
CCTATGCTGCACCCCACCAAGGTTAGGAAGAGGTCCTGTCCTCAGTGGGGCCCTCTGATG 
AACAGCCCATCAGGTCTGCGTCCACATGCCTTGGAAGAGATGGTGACATACTCAAAGTCC 
TTGAAGCCGCATATTAAACCACCTAGAGCACCATCTTCAAACATTTAGGGTCTGAGAAGA 
TAGGGGAAGTAAGCAATTTAAAACATTTCTTTATATTGGGCCAGGTGCAATGGCTCACGT 

23873 GGTCTGAGAAGATAGGGGAAGTAAGCAATTTAAAACATTTCTTTATATTGGGCCAGGTGC 
AATGGCTCACGTCTGTAATCCCAGCGCTTTGGGAGGACGAGGATCACCTGAGGTCAGGAG 
TTCAAGATCAGCCTGGCCAACATGGAGAAACCCCATCTCTACTAAAAATACAAAAATTAG 
CTCAGGCGTGGTGATGTGCACCTGTAATCCTAGCTATTCAGGAGGCTGAGGCACAAGAAT 

TGCTTGAGTCAATATTGCACCACTGCACTCCAGCCTGGGCAACAGCGAGACTCTTGTCTC 
[A,-] 

AAAAAAAAAAAAGATATTTGCTGAAAAGACCCAGCCTGCCAAACTCAGGGGCAGCCAAGG 
GAGGTAGTGAAATGGAAGTTGGAGCTCAGCGCTCCCACACCTCCACTGCCCTCAGGCCTT 
CTCTGCCTCTTTCCCATCAGTCAGCTGCTTCTGGGCATGGTCCTGGCAGAGACTTGGCCT 
CCTTCCAGTTCAAGCTCCCTCTTAGATTGTGTCCCACGCCACTGAGTCTTTGGGACACTG 
GGTCAGATGTCTAGTCTGGCACAATTGGCAGGAATCCCAAGAAACAGTGTGAGTGAGGGG 

247 64 ATAGCTGGGATTACAGGTGTGCACCACCATGCCCAGCCTAATTTTTGTATTATTAGTAGA 
GATGGGGTTTCACCATGTTGTCCAGGCTGGTCATGAACTCCTGACCTCAAGTGATCCACC 
CGCTTTGGCCTCCCAAAGTGCTGGGATTACAAGCATGAGCCACAGTGCCTGGCCTGACCC 
TGCTCTTTTGAAAGACCATTCCCCCAAATTCTGTGCACCTGTGTGCCTTTCTTCTCTCTG 
CCTCCTCTCAGCTCTGCCCCGCTCTCCTCCCTTCTCCTCTGGCAAATCCCACTCATCTCT 
[G,T] 

GAAGCCCTTCTTCCAGGGGAAGCCCTGATCATGCTGCTTTCTCCTGTGGGAGGGATGAAG 
GACGTGGCCCACGGAGTTTGTTTTGTTTTGTTTTGAGATGGAGTTTTGCTCATGTTGCCC 
AGGCTGGGGTACAATGGTACGATCTCAGCTCACTGCAACCTCTACGTCCCGGGTTCAAGC 
GGTTCTCCTGCCTTAGCCTCCCCAGTAGCTGGGATTACTGGCATGAACCACCACACCTGG 
CTAATTTTGTGTTTTTAGTAGAGATGGGGTTTCTTCATGTTGGTCAGGCTGGTCTCGAAC 

24939 GACCCTGCTCTTTTGAAAGACCATTCCCCCAAATTCTGTGCACCTGTGTGCCTTTCTTCT 
CTCTGCCTCCTCTCAGCTCTGCCCCGCTCTCCTCCCTTCTCCTCTGGCAAATCCCACTCA 
TCTCTTGAAGCCCTTCTTCCAGGGGAAGCCCTGATCATGCTGCTTTCTCCTGTGGGAGGG 
ATGAAGGACGTGGCCCACGGAGTTTGTTTTGTTTTGTTTTGAGATGGAGTTTTGCTCATG 

TTGCCCAGGCTGGGGTACAATGGTACGATCTCAGCTCACTGCAACCTCTACGTCCCGGGT 
[T,C] 

CAAGCGGTTCTCCTGCCTTAGCCTCCCCAGTAGCTGGGATTACTGGCATGAACCACCACA 
CCTGGCTAATTTTGTGTTTTTAGTAGAGATGGGGTTTCTTCATGTTGGTCAGGCTGGTCT 
CGAACTCCCAACCTCAGGTGATCTGCCTGCCTCGGCCTCCCAAAGTACTGGGATTACAGG 
GTTGAGCCACTGTGCCTGGCCCAGGCCCACGGAGTTTTAAGAGGCTTCCTGTGGCAGTGG 
CATCCAGACGGAGTGCAGAAACTCAAAGTTGAAGGCCAGAAGCTCAGGGAAGGGGGAGTG 

24945 GCTCTTTTGAAAGACC ATTCCCCCAAATTCTGTGC ACCTGTGTGCCTTTCTTCTCTCTGC 

CTCCTCTCAGCTCTGCCCCGCTCTCCTCCCTTCTCCTCTGGCAAATCCCACTCATCTCTT 
GAAGCCCTTCTTCCAGGGGAAGCCCTGATCATGCTGCTTTCTCCTGTGGGAGGGATGAAG 
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GACGTGGCCCACGGAGTTTGTTTTGTTTTGTTTTGAGATGGAGTTTTGCTCATGTTGCCC 
AGGCTGGGGTACAATGGTACGATCTCAGCTCACTGCAACCTCTACGTCCCGGGTTCAAGC 
[G,A] 

GTTCTCCTGCCTTAGCCTCCCCAGTAGCTGGGATTACTGGCATGAACCACCACACCTGGC 
TAATTTTGTGTTTTTAGTAGAGATGGGGTTTCTTCATGTTGGTCAGGCTGGTCTCGAACT 
CCCAACCTCAGGTGATCTGCCTGCCTCGGCCTCCCAAAGTACTGGGATTACAGGGTTGAG 
CCACTGTGCCTGGCCCAGGCCCACGGAGTTTTAAGAGGCTTCCTGTGGCAGTGGCATCCA 
GACGGAGTGCAGAAACTCAAAGTTGAAGGCCAGAAGCTCAGGGAAGGGGGAGTGTGAGTT 

25092 ATCATGCTGCTTTCTCCTGTGGGAGGGATGAAGGACGTGGCCCACGGAGTTTGTTTTGTT 
TTGTTTTGAGATGGAGTTTTGCTCATGTTGCCCAGGCTGGGGTACAATGGTACGATCTCA 
GCTCACTGCAACCTCTACGTCCCGGGTTCAAGCGGTTCTCCTGCCTTAGCCTCCCCAGTA 
GCTGGGATTACTGGCATGAACCACCACACCTGGCTAATTTTGTGTTTTTAGTAGAGATGG 
GGTTTCTTCATGTTGGTCAGGCTGGTCTCGAACTCCCAACCTCAGGTGATCTGCCTGCCT 
[C,T] 

GGCCTCCCAAAGTACTGGGATTACAGGGTTGAGCCACTGTGCCTGGCCCAGGCCCACGGA 
GTTTTAAGAGGCTTCCTGTGGCAGTGGCATCCAGACGGAGTGCAGAAACTCAAAGTTGAA 
GGCCAGAAGCTCAGGGAAGGGGGAGTGTGAGTTGAGGAGTCTCTTGGCTGCCAGGGCCAG 
AAACCGAACTCCAAGCCTCTCCACAACAGCGGGTGTAGAGCATGTAGAATCAGAGAGGAG 
GCTGAGCCATGCAGCCCCGAGAAGAGGGGAATGCCACTGAGCCACAGAGACCCAGTGCCA 

25428 AGTGCAGAAACTCAAAGTTGAAGGCCAGAAGCTCAGGGAAGGGGGAGTGTGAGTTGAGGA 
GTCTCTTGGCTGCCAGGGCCAGAAACCGAACTCCAAGCCTCTCCACAACAGCGGGTGTAG 
AGCATGTAGAATCAGAGAGGAGGCTGAGCCATGCAGCCCCGAGAAGAGGGGAATGCCACT 
GAGCCACAGAGACCCAGTGCCACTGCCAGGTGTCTCTGCCTCCACTTCCCATGACCC 
[T,G] 

GCCTGTCTCTGTATGCAGGCTTCACCCTCTCTCGTTGTACATTGTACACATTCTAGGTGA 
CACCAGCAGCTTCTGATTCTCATCTCCCATAACATCAGCCCCCCAGAGAGGGGACAACTG 
CTGAGCTGATAACATAATAGATGCCCCTTTCCTGGAGGCCATGGTCATGGTCAGCGTGGA 
GAGGATGAAGCCTGAGCAGGCAGGATCGGGGGTCTAGAGGGGAAGGAGGTGGAAGTT 

25513 GGCCAGAAGCTCAGGGAAGGGGGAGTGTGAGTTGAGGAGTCTCTTGGCTGCCAGGGCCAG 
AAACCGAACTCCAAGCCTCTCCACAACAGCGGGTGTAGAGCATGTAGAATCAGAGAGGAG 
GCTGAGCCATGCAGCCCCGAGAAGAGGGGAATGCCACTGAGCCACAGAGACCCAGTGCCA 
CTGCCAGGTGTCTCTGCCTCCACTTCCCATGACCCGGCCTGTCTCTGTATGCAGGCTTCA 

CCCTCTCTCGTTGTACATTGTACACATTCTAGGTGACACCAGCAGCTTCTGATTCTCATC 
[C,T] 

CCCATAACATCAGCCCCCCAGAGAGGGGACAACTGCTGAGCTGATAACATAATAGATGCC 
CCTTTCCTGGAGGCCATGGTCATGGTCAGCGTGGAGAGGATGAAGCCTGAGCAGGCAGGA 
TCGGGGGTCTAGAGGGGAAGGAGGTGGAAGTTGAGATCACAGACCTGTGGTCAGGTGGCC 
TGGGAAGGGTTTGACGAGTGTCGGCCCAAAGAGCTTGGAAGGGATTTTGCTGCTGTGGGT 
GAGCACTGCCTCTCCCCTTAGGGACAACAGCCACCTCTTCTCTCCCCATTTGCCTTTCCC 

25684 CCAGTGCCACTGCCAGGTGTCTCTGCCTCCACTTCCCATGACCCGGCCTGTCTCTGTATG 
CAGGCTTCACCCTCTCTCGTTGTACATTGTACACATTCTAGGTGACACCAGCAGCTTCTG 
ATTCTCATCTCCCATAACATCAGCCCCCCAGAGAGGGGACAACTGCTGAGCTGATAACAT 
AATAGATGCCCCTTTCCTGGAGGCCATGGTCATGGTCAGCGTGGAGAGGATGAAGCCTGA 
GCAGGCAGGATCGGGGGTCTAGAGGGGAAGGAGGTGGAAGTTGAGATCACAGACCTGTGG 
[C,T] 

CAGGTGGCCTGGGAAGGGTTTGACGAGTGTCGGCCCAAAGAGCTTGGAAGGGATTTTGCT 
GCTGTGGGTGAGCACTGCCTCTCCCCTTAGGGACAACAGCCACCTCTTCTCTCCCCATTT 
GCCTTTCCCTTCTGTAGATATGAAACACAGGCCTCCTTGTCAGGCCCCTACTTAACCTCC 
GTGATGGGGAAAGCGGCCGGAGAAAGGGAGTTCTGGCAGCTTCTCCGAGACCCCAACACC 
CCACTGTTGCAAGGTGAGTCATGGCCTGACACTCTGGATGTGTCCCCTACCCCAAGCTTA 

26165 GTGATGGGGAAAGCGGCCGGAGAAAGGGAGTTCTGGCAGCTTCTCCGAGACCCCAACACC 
CCACTGTTGCAAGGTGAGTCATGGCCTGACACTCTGGATGTGTCCCCTACCCCAAGCTTA 
CTCAGCCAAGAGGCTTCATCAACTCACCCCAGCTTTCCCTAGCACCCTCCTGGGCCACAC 
CTTCACAAAATCACTGATGCTCAAAGTTGGATATAATATATTGAACTGAAGCCTTAGCAT 

TTTTATGCAAGTTACTGTGGAAATTCTAGGAAACCAGACAGATTACAAAAAAAAAAAAAA 
[A,-] 

CTAGAAGAAAATTAACATCACCTAGGATATACTACCTAGGAATAACGTCTTTTATTTTGA 
GATGGAGTTTCGCTCTTGTTGCCCAGGCTGGAGTGCAGCGGTATGATCTCGGCTCGCTGC 
AACCTCCGCCTCCTGGGTTCATGTGATTCTTCCACCTCGGCCTTCCTAGAGCCCAAGTGG 
TCTGCCTGCCTCTGCCTCCCAAAGTTCTGGGATTACAGGCATGAGCCACCGCACCCAGCC 
AAAATTACTTAACTTTTCTTCTAGATACTTTTTAAAAATATGGCAGTAAGTTTTTCATAA 
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Isoform 2; 
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Exon : 


2132-2195 

-i- J, ^/ J- „s *J 


Intron : 


C. _L J \J JU/U 


Exon : 
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Sft fi4-7 67? 




7fi71-77fi1 


Intron ■ 


77 62-914 9 


Exon: 


9150-9302 


Intron : 


9303-10101 


Exon : 


10102-10274 


Intron : 


10275-10399 


Exon: 


10400-10586 


Intron: 


10587-12128 


Exon : 


12129-12294 


Intron: 


12295-25922 


Exon : 


25923-26040 



AJLXeJ-lC 


Variants 


(SNPs) : 




DNA 








Position 


Maj or 


Minor 


Domain 


1022 


C 


T 


Beyond ORF(5 * ) 


1882 


C 


T 


Beyond ORF ( 5 1 ) 


1951 


G 


A 


Beyond ORF(5' ) 


2940 


A 


G 


Intron 


3831 


G 


A 


Intron 


6732 


G 


A 


Intron 


7558 


\j 


A 


Intron 


7931 


T 


A 


Intron 


8190 


T 


C 


Intron 


8652 


C 


T 


Intron 


9370 


G 


C 


Intron 


9463 


C 


T 


Intron 


9883 


G 


T 


Intron 


11594 


G 


A T 


Intron 


12361 


A 


G 


Intron 


13187 


T 


C 


Intron 


13284 


T 


c 


Intron 


21341 


C 


G 


Intron 


21796 


A 


G 


Intron 


22159 


C 


T 


Intron 


22334 


C 


A 


Intron 


22346 


C 




Intron 


22476 


C 


T 


Intron 


23143 


A 


- T 


Intron 


23445 


T 


C 


Intron 


23974 


A 




Intron 


24865 


G 


T 


Intron 


25040 


T 


C 


Intron 


25046 


G 


A 


Intron 


25193 


C 


T 


Intron 


25529 


T 


G 


Intron 


25614 


C 


T 


Intron 


25785 


C 


T 


Intron 


26266 


A 




Beyond ORF (3' ) 



Protein 

Position Major 



Minor 



Context: 
DNA 

Position 



1022 TTGGAGAT ATTTT AAGGTCAT AGTGTCTTCACAAATTGAGCTGAAAGGGAACTGTTAGGA 

TGATCTTGCCTAACCCTCTCATCTCACACAGG7AAGAACTATTTTAAACTCGAGAGGTTAA 
GTGACCTGGCCAAAGTCACACAGCCACCACTAGTTAACTCGTATACATTGATTCTCCTGT 
GGGGCTGGGCAGATGAGGAATCTTTTGTTCTCTTCCCTGTTTGCAGAGATTTTTTTTGAG 
GTTACTTTCCGAGTTCTGGCAAGTACCCCTGCTTCTGGTAGCTTTGTGTCTCGATTC7AAT 
[C,T] 

TCATTCTTTTTATTTTATTTTATTTTTGAGACAGGGTCTCACTTTGTCACCCAAGCTGGA 
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GTGCAGTGGTGTAATCTTGGCTCACTGTAGCCTCCACCTCTTGGGTTCAAGCGATCCTCC 
TGCCTCAGCCCCCCAAGTAGCTGGGATTACAGACGTCTGCCACCACGCCAGGCTAATTTA 
TGGTTTTTTGTATGTGTTTTTTGTGTTTTTGTAGAGACAGTGTTTCCCCATGTTGCCCAG 
GCTGGTCTCCAACTCCTGAGCTCAAGTGATCTGCCCGCCTCAGCCTTTCAAAGTGCTAGG 

1882 ACAGCTGACTCCAGCAATGCTGCTCACGTGACCACTGCAGCTGCAGCTCCCGTTCCACTC 
CTTGTCCTGGGCTAGGTGGGCACTACCAGGGGCTCCTTTGGTAAGGAGTACCGGGTAGGC 
ACCCGGTCCTGCCAATCCACCACTGGAACAGCTGGGGGGACAGCAGACAGGCACGGTCGG 
ACAGACTTGACAGATCAGGCATCAGGCCCTCTGCGCTGGTCCCGGGCTCTTTAAGCAGGA 
ACGTGAATGGCCTCAAGATGTCTCACATGGTCCCACTAGCCCTCCTCCTCCCTTTGTTCC 
[C,T] 

TACCTCCAGGAGGGCTGCTCTGCCCTTCCTTCCTCTGTTCTTTGGCCTTATGTTCCCCGC 
CACCACAGGCCTTCCCCCGCCCCACCCCTCTGCAGACTTAGCCGTGCATTGCAGGCATGG 
AGGATTAATCAGTGACAGGAAGCTGCGTCTCTCGGAGCGGTGACCAGCTGTGGTCAGGAG 
AGCCTCAGCAGGGCCAGCCCCAGGAGTCTTTCCCGATTCTTGCTCACTGCTCACCCACCT 
GCTGCTGCCATGAGGCACCTTGGGGCCTTCCTCTTCCTTCTGGGGGTCCTGGGGGCCCTC 

1951 GGCTAGGTGGGCACTACCAGGGGCTCCTTTGGTAAGGAGTACCGGGTAGGCACCCGGTCC 
TGCCAATCCACCACTGGAACAGCTGGGGGGACAGCAGACAGGCACGGTCGGACAGACTTG 
ACAGATCAGGCATCAGGCCCTCTGCGCTGGTCCCGGGCTCTTTAAGCAGGAACGTGAATG 
GCCTCAAGATGTCTCACATGGTCCCACTAGCCCTCCTCCTCCCTTTGTTCCCTACCTCCA 
GGAGGGCTGCTCTGCCCTTCCTTCCTCTGTTCTTTGGCCTTATGTTCCCCGCCACCACAG 
[G,A] 

CCTTCCCCCGCCCCACCCCTCTGCAGACTTAGCCGTGCATTGCAGGCATGGAGGATTAAT 
CAGTGACAGGAAGCTGCGTCTCTCGGAGCGGTGACCAGCTGTGGTCAGGAGAGCCTCAGC 
AGGGCCAGCCCCAGGAGTCTTTCCCGATTCTTGCTCACTGCTCACCCACCTGCTGCTGCC 
ATGAGGCACCTTGGGGCCTTCCTCTTCCTTCTGGGGGTCCTGGGGGCCCTCACTGAGATG 
TGTGGTGAGTAACTCGCCTCTATCCTGTGCCTCTTTCCTCCTGGGTCCTTAGTGGGGTGG 

2 940 AACATAGGGAGACCCCATCTCTACAAAAAATAAAAAAAATTAAAAAATAGCTGGGCATGG 
GGAAGACTTTCTGAAGACCAAGAGGACACATGGGAGCTGAAACTCGAAGGAAGAAAAGGA 
GCTGGCAGGAAAGGAGTGGGGGACACACATTCTAGGCAGCAGGAAGTGAGCCTTCGGAGG 
TCCTGCCTGCTCCAGCTCTGTGCCCCAAGGGGTCTCTTGGAGCACAGTCTCCTGGGACCT 
GTCTATGAGTCTGAGCTTAGAGGCTCAGGGCTGCTCCTTCAGACAGGAGGCAGAAGGCAG 
[A,G] 

CTTTGGGAACTTTGGGCCGCCCACGCGCCTTTTCTCCTCCTCTGCACCTAGGATTACGTT 
GAGCAATACACTTTCACCCCCATGGTCTCTTGAGACCCTGGGGAAACCCTGAGAGGTGGG 
TGCAGTCATGTCCAGGTGTCAAGTGAAGAAGTCGAGGGTTGGAGGGGCTGAGTGACCCAC 
TCAGGGTGCTCCACCTTTTCCAGAGCTTTGCTGAACTTAGTTTTTAGAACTTGAAGCCTC 
GTTTGTTTTCGTTTTGTTTTTTGTTGAGAGAGGTTCTCCCTCTGTTGCCCAGGCTGGAGT 

3831 GACACCTCAGGTCTGGGCCCAGGAACCCCAGCTCTTGGTTCATGTCCGGACAGTCCCCAG 
GGGAGTTCTGGGTTCAACCAGCAAGAGCTCTTCCTCCTGGCTGATCTGGTCCTCAGCCTT 
GGACAGTTAGTCCATTAACCTGACCCCACAGGAGCCCCAATCCCTTGGGGTCTGGGGAAT 
CTTGAACTGGGGTTTGGGGTGCAAATATCTGCACTGAGTCACTTAATTGCACCCAGCCTC 
ATTCCTTTATCTGTAAAGTGGGCTAAGAATGCTCCCCTGCCTTCCTCCTCGGTGTAGTAC 
[G,A] 

AGGAAGGATCCCATGACACCTGCTCTCCCAGTTTAAAGCTCTATATGTATGTTGTGAAAT 
TGACAGGGATCGCTGCACAAACGCTAATGCAAAGTGGGCTCCTGTGCTTCCTTTTCTCTT 
TCTTCTTCTTTTTTTTTTTTTTAATTTTCTTCTAGAGATGAGGTCTCACTATATTGCCCA 
GGGTTGGTTTCAAACTCCTAGGGTCAAGCGATCCTCCCACCTTGGCCTCCCAAACTGCTG 
GTATTACAGGCGTGAGCCACTCTGTCTGGCTCCTATGCTTGTGAATGTCAACAGCAATCA 

6732 TGAATAGCTGGGATTACAGGCGTGTGCCACCATGCCCAGCTGATTTTTGTATTTTTGGTA 
GAGATAGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCG 
CCTGTCTTGGCTTCCCAAAGTGTTGGGATTATAGGCATGAGCCACTGCACCAATCCAAAA 
GCAGCATCTTTGTGCTCCCTTTTCAAGAGGCATCACAGAGAGGCCTGTTTTGGGGTTTGA 
ATGAGAGGCGAAGAATCAGCCATGGAGTGCCTCTTTCTCAGACTCCCTCTTGAGAAGTGG 
[G, A] 

TGCAGGGGTGGAGAGAAAAGAAGACTAGGCATAGTGGCTCATACCTGTAATCCCAACATT 
TTGGGAGGCTGAGGCAGGAAGATTGCTTGAGCTCAGGAGTTTGAGACCAGCCTAGGCAAC 
ATAGTGAGACCACATCTCTTAAAAAAAAGAAAAAGAAAAAAAATGAGCCAGGTGTAGTGA 
CTCATGCCTGTGGTCCCCACTTCTCCGGAGGCAAAGGTGGGAGGATCTTTTGAGGCTGAG 
AAATCGAGGCTACAGTGAGCCATGGTGGCACCACTGCACTCCAGCCTGGGAGACAGAGAG 

7558 AAGAATCGCTTGAACTCGGGAGGCGGAGGTTGCAGTGAGCTGAGAACATGCCACTGCACT 
CCAGCCTGGGCAACAAGAGCGAAACTCTGTCTCAAAGAAAATAAATAAATAAAATAAAAA 
AATAAAAAAGGAGGGGGCATATGGGTGAAGTATGGACAAAATAGTGGGGCAGGCACAGAT 
GATCTGGACACAGGAGCCCTTGGAGTTTATTCTTGAATCTAACTGTTCATCTTTATTAAA 
TATTTGTGGCATACACCTCACAACAACATAGCCAACACACCTCCTTTTGGAGCTTTTATC 
[ G, A] 
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AAGTTTCCCACTGTTAAGATTTTTTCCCGCTTTGTGATGCGGGTGGGGTGGGTGCTGTAA 
GCAGGCTTACGGGGTGGCAGTTTCTCACAAAGGCATTAACTGGCCTTGTCCTAGGTCTGC 
CTTCAGCGAGGATGACGGTGACTGCCAGGGCAAGCCTTCCATGGGCCAGCTGGCCCTCTA 
CCTGCTCGCTCTCAGAGCCAACTGTGAGTTTGTCAGGGGCCACAAGGGGGACAGGCTGGT 
CTCACAGCTCAAATGGTTCCTGGAGGATGAGAAGAGAGCCATTGGTGAGCAGACACCATC 

7931 GGTGGCAGTTTCTCACAAAGGCATTAACTGGCCTTGTCCTAGGTCTGCCTTCAGCGAGGA 
TGACGGTGACTGCCAGGGCAAGCCTTCCATGGGCCAGCTGGCCCTCTACCTGCTCGCTCT 
CAGAGCCAACTGTGAGTTTGTCAGGGGCCACAAGGGGGACAGGCTGGTCTCACAGCTCAA 
ATGGTTCCTGGAGGATGAGAAGAGAGCCATTGGTGAGCAGACACCATCCGCTGGGGGTGG 
GGAGCAGCTGGGAGGGCTCATCAGATGATATTCTCCAATGAGAATCAGAACTTTGGGTTT 
[T,A] 

CTCCCCAGGCGTCTTTCCCACCATCCATTCTGCCCATCTCACTGCCTACGTAGAGGCTCG 
AACCTGTCCCCATAGCCATCCTTGACCCAGCTTTTCCCGCGCTGCACACATACTATTGAC 
AGGTGTGTTTCGTGGTTTTTTGTTTTTTGTTTGTTTGTTTGTTTTGAGTTGGAGGTTTGC 
TCTTGCTGCCCAGGCTGGAGTACAATGGCGCAATCTCAGCTCACCGCAATCTCTGCCTCC 
TGGGTTCAAGCAATTCTCTTGCCTCAGCCTCCTGAGTAGCTGGGATTACAGGCATGCGCC 

8190 ATCAGATGATATTCTCCAATGAGAATCAGAACTTTGGGTTTTCTCCCCAGGCGTCTTTCC 
CACCATCCATTCTGCCCATCTCACTGCCTACGTAGAGGCTCGAACCTGTCCCCATAGCCA 
TCCTTGACCCAGCTTTTCCCGCGCTGCACACATACTATTGACAGGTGTGTTTCGTGGTTT 
TTTGTTTTTTGTTTGTTTGTTTGTTTTGAGTTGGAGGTTTGCTCTTGCTGCCCAGGCTGG 
AGTACAATGGCGCAATCTCAGCTCACCGCAATCTCTGCCTCCTGGGTTCAAGCAATTCTC 
[T,C] 

TGCCTCAGCCTCCTGAGTAGCTGGGATTACAGGCATGCGCCACCACACCCAGCTAATTTT 
GTATTTTTAGTAGACGTGGGGTTTCTCCATGTTGGTCAGGCTGGTCTCGAACTCCTGACC 
TCAGGTGATCCGCTTGCCTTAGCCTCCGAAAGTGCTGGGATTACAGGCATGAGCCACTGC 
GTTAGGCCCACTGACAAGCCTTGTATTGGCTAGCCACCAAGATTGACTTGATTATCCACC 
TTCGGGACAACTGGACAGCCTGCTTATGACTTACGCCATAGTCTGTCTCTACTAGCTCTC 

8652 TACAGGCATGAGCCACTGCGTTAGGCCCACTGACAAGCCTTGTATTGGCTAGCCACCAAG 
ATTGACTTGATTATCCACCTTCGGGACAACTGGACAGCCTGCTTATGACTTACGCCATAG 
TCTGTCTCTACTAGCTCTCCTGCCCTGACTTGACCCAGCATACAACAGCCAGAGCCAGCC 
TTTTCAATATAAACCTGATCTTGCTGGCACTGCTTAAACCCTGCAGGGGCCTCGCACTGC 
TCCATGGCCCAGCCTGTCTACCCTTACCTTCTGCCCAGGCTCTGCTCATCCATTCTCTGC 
[C,T] 

TCCCACACACCTGCCCTCTGTGGGCTCCAGCCATACCATCTCTCAACTCATAAGCCAGTT 
TTTTCATACAGGCTCCCTCCATCTGGACTGGCTTCCCTGCGTGCAGTTCACTCCTGCTCT 
ACCTTTGGCTCTGCCTCCACCCATCCTCAGCCGTCTCCAGCATTACCTCCTTGGAGAATC 
CTGCCTTGACTTCCCAGCCACCCAAATATCACTACTTGGTCTGCATTCTCGTTGCAATTG 
CAGTCGCATGAGCAATTGCTGTGGTTGAGGCCCGAACTGCGCAAGTGCCTGTCTGCCATG 

9370 AGGCCAGGGTCCCAGGTGCTGGCGGGGCTGGCTGCTGGGTGGGGGCAGAGAGGCAACCCC 
TCTGTTTTTTTCCCTCTCAGGGCATGATCACAAGGGCCACCCCCACACTAGCTACTACCA 
GTATGGCCTGGGCATTCTGGCCCTGTGTCTCCACCAGAAGCGGGTCCATGACAGCGTGGT 
GGACAAACTTCTGTATGCTGTGGAACCTTTCCACCAGGGCCACCATTCTGTGGGTGAGTA 
GGTCAGACCGTGCCAAGGCCAGGCTGGCACTCCCTCAGTCCCCAGGTCTGCACTGATGAC 
[G,C] 

TCCATACCCTGGCCCCCACACTCACCTTTCCTTGGGGCTCCTCCGAATCAAGTCCTTTAG 
GGACGAATTGGCGAGGGCTCATGGGTGATGCTCCAGCTGTGAGCCAGCTTTGGAGCTGGT 
AGGTGGATCTCTTGAGGCCAGGAGTTCAAGACAACGTGGTGAAACCCCATCTCTACTAAA 
AATAAAAAAGTTAGCCGGGCATGGTGGCACATGCCTGTAGTCCCAGCTACTCGGGAGGCT 
GAGGCAGGAGAATCACTTGAACCTGGGAGGCGGAGGCTGCAGTGAGTGGAGATCGCACCA 

94 63 GGGCCACCCCCACACTAGCTACTACCAGTATGGCCTGGGCATTCTGGCCCTGTGTCTCCA 
CCAGAAGCGGGTCCATGACAGCGTGGTGGACAAACTTCTGTATGCTGTGGAACCTTTCCA 
CCAGGGCCACCATTCTGTGGGTGAGTAGGTCAGACCGTGCCAAGGCCAGGCTGGCACTCC 
CTCAGTCCCCAGGTCTGCACTGATGACGTCCATACCCTGGCCCCCACACTCACCTTTCCT 
TGGGGCTCCTCCGAATCAAGTCCTTTAGGGACGAATTGGCGAGGGCTCATGGGTGATGCT 
[C,T] 

CAGCTGTGAGCCAGCTTTGGAGCTGGTAGGTGGATCTCTTGAGGCCAGGAGTTCAAGACA 
ACGTGGTGAAACCCCATCTCTACTAAAAATAAAAAAGTTAGCCGGGCATGGTGGCACATG 
CCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCACTTGAACCTGGGAGGCGG 
AGGCTGCAGTGAGTGGAGATCGCACCACTGCCCTCCAGCCTGGGCAACAGAGTGAGTGAG 
AC T C T G T C T C A AA AAAT A AAA AAT A AAAT AAAAC T C C C C TAG T GAT T C C A A T GT GC AGC T 

9883 GCCTGTAGTCCC AGCTACTCGGGAGGCTGAGGCAGGAGAATCACTTGAACCTGGGAGGCG 

GAGGCTGCAGTGAGTGGAGATCGCACCACTGCCCTCCAGCCTGGGCAACAGAGTGAGTGA 
GACTCTGTCTCAAAAAATAAAAAATAAAATAAAACTCCCCTAGTGATTCCAATGTGCAGC 
TAAGTTTGGAAATAGGTGGTATGGGGTCAAGTCCTCTTGGGCCTCCCTCCTCCAGTCCTT 
CTCCCTAACCTCTAGCCCTCAAGTTGCAGAGTGATCAGCCAAACCAGTTTGCCCAGAAAT 
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EG/ T] 

AGCAGTTTCCTGGGACACAGGATTTTCAGAGTCCAGACAAGGAAAGTCTTGGGCAGACCA 
GGTTGAGTTGGTGCCCTTAGCTGATCTGACCATGTTGCCCTTCTTCTCCAAGCCCTCCTG 
TGGTTGTCCATAGCTACAAGGGCCTGACCCTCAAGCCCCTGCCTGTCCTGGCCCCTTTGG 
CTCTCCAGCTCATTGCATGTTCTGTCCCCCACTTCAAGACACAGCAGCCATGGCAGGCTT 
GGCATTCACCTGTCTGAAGCGCTCAAACTTCAACCCTGGTCGGAGACAACGGATCACCAT 

11594 AAAAAAAAAAATGGAGAAGAAGGAAGCTGGACATGGTGGCTCGTGCTTATAATCCTAGCA 
CTCTGGGAAGCTGAGGCAGATGGATTGCCTGAGCCCAGGAGTTTGAGACCAGCCTGGGCA 
ACATGGTGAAACCCTGTCTTTACTAAAATACGAAAGATTAGCCAGGCATGGTGGTAGACA 
CCTATAATCCCAGCTACTAGGGAGGCTGAGCCACAAGAATCACTTGAACCTGGGAGACAG 
AGGTTGCAGTGAGCCGAGATCGCGCCATTGCACTCCAGCCTGGGCGACAGTGTGAGACTC 
[G,A,T] 

GTCTCCAGAAAAAACAAGAATGGATAGAGTGGAGCCAAGAAGAGGCAGGAAGAACAAAGA 
CACAGAGGTGCACAGAGTTTGGGGGAATTTTGAGGAATGGTCTTGCAAAAGAGTGGGATC 
TGGGAGAATGAGTGGGAGTGGAAAGCAGATGAATGAAGAGAAGGTGAGCGCATCAGGGTA 
ACAGAGATGCGTTGTGAACAAATGCATGTTCTAGGAAGAGCCCTCTGGAGTGCTAGGTGC 
CAGAGAGGTGGGAGGAAGGATACTGGAAGCAGAGAAACCAGTGAGGGGCCTGATCTTGGG 

12361 ACAAGAAGACAAATAATCCAGGCTCTCTGTCCTCACACCAGCTGCCCGCCCCTTTCTTCC 
TGGCACAGTCATGTTGGAACCAGCTGCTGAGACCATTCCTCAGACCCAAGAGATCATCAG 
TGTCACGCTGCAGGTGCTTAGTCTCTTGCCGCCGTACAGACAGTCCATCTCTGTTCTGGC 
CGGGTCCACCGTGGAAGATGTCCTGAAGAAGGCCCATGAGTTAGGAGGATTCACGTGAGA 
CTCCCACCTCCCAGTCCTCACCCCACCCAACCTCACATGCCTGATAACAGGGTCACAGAA 
[A,G] 

AGACGGGGAACAGAGGAGAGGGTTCCCTCGGGAGAGACACTGGCCCTGCTTCTGCTTCTA 
CCTGCTCAGCTCCTTTCTTGCCCACGGTGTTATGGAAACAGGGAGCCATAGGCCAGCATT 
GTCACTGAGAGAGCAGGCTTTGGAGGCAGAGCCCCCCAGTTGGAATCCCAACTCTAACCA 
GCTAGGTTCCAGGTAGGCACCCACAATTCACCGAGGAGAACAGTTGTGCCCCTTCCCTGC 
AGGGCCAGTGTGAAGAGTCCAGGAGTTAGTACACATAGAGATAGTGGCATGTGCTTTTTA 

13187 GGCACGTGCCACACAACCCTGGGTAATTTTTTTTTTTTTTTTTTTGAGATAGGGTCTCTG 
TCTGTTGCCCAGGCTGGTCTCAAATTCCTGGCCTCAAACCATCCTCACACCTGAGGCGCT 
CAAAATATTGGGATTATAGGTGCGAGCCATCATGCTCAGCCAGAATAATAACTGGTTTTT 
TTTGTTTTTTTTTTGAGACAGAGTCTCACTCTATTACCCAGGCTCTGGAGGCCCAACTCG 
TGTTTGTGTATTTGTTTATTTTTATTTATTTATTTATTTCGAGACAGAGCCTCTCTCTTT 
[T,C] 

ACCTAGGCTGGAGTGCAGTGGCGCAATCTCGGCTCACTGCAACCTCCGTCTCCTGGGTTC 
AAGTGATTGTCCTGCCTCAGCCTCCTGAGTAGCTGGTGCTACAGGCGCGTGCCACCATGC 
CCAGCTAATTTTTGTATTTTTAGTAGAGACAGGGTTTTACTATGTTGGCCAGCTGGTTTC 
TAACTCCTGAACTCGGGTGATCTGCCTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGC 
ATGGGCCTCCGTGCCCGGCCATGTATTTATTTAGGCAAGGTCTCTCTCTGTTATCCAGGC 

132 84 ACCATCCTCACACCTGAGGCGCTCAAAATATTGGGATTATAGGTGCGAGCCATCATGCTC 
AGCCAGAATAATAACTGGTTTTTTTTGTTTTTTTTTTGAGACAGAGTCTCACTCTATTAC 
CCAGGCTCTGGAGGCCCAACTCGTGTTTGTGTATTTGTTTATTTTTATTTATTTATTTAT 
TTCGAGACAGAGCCTCTCTCTTTCACCTAGGCTGGAGTGCAGTGGCGCAATCTCGGCTCA 
CTGCAACCTCCGTCTCCTGGGTTCAAGTGATTGTCCTGCCTCAGCCTCCTGAGTAGCTGG 
[T,C] 

GCTACAGGCGCGTGCCACCATGCCCAGCTAATTTTTGTATTTTTAGTAGAGACAGGGTTT 
TACTATGTTGGCCAGCTGGTTTCTAACTCCTGAACTCGGGTGATCTGCCTGCCTCGGCCT 
CCCAAAGTGCTGGGATTACAGGCATGGGCCTCCGTGCCCGGCCATGTATTTATTTAGGCA 
AGGTCTCTCTCTGTTATCCAGGCTGAAGTGCAGTGGCACATTCATAGCTCACTGCAGCCT 
CAAATTATCCAAGTAACAGGGACTACAGGCATGCACCACCACACCCATCTACTTTTTTTT 

21341 TCAGCTCACTGCAACCTCTGCCTCCCTGGCTTAAGCGATCCTCCCACCTCAGCCTCCTGA 
GTACGTGTGACCATAGGCCCATGGCACAAAGCCCAGCTAATTTTTTGTATTTTTAGTAGA 
AATGTGGTTTCACCATGTTGCATAGGCTGGTCTCGAACTTCTGAACTCAAGTGATCTGCC 
TGCCTTGGCCTCCCAAAGTGCTGGGATTCTAGGTATGAGCCACCCTGCTCGGCCTATAAT 
GGCACTTTCCTATCCCATTGATGAGGCTCTACTCTCATGACCTAATCATCTCCCAAAGGC 
[C,G] 

CTAAGGCCTCCTGATACCATCACCTTTGGGGTTAGGTTTTAACATATACATTTTGGGGGG 
ACACAGACATTTTAGACCATAGCACCTCCATTGAAAGGAAACATTTCTGACACCTGGCTA 
TCTCAAAGGGCCCTTTCAGTTCCCCTGCAGGCTGCATTCCCACATCACCAACAAGAGCAG 
CGACACTCACTCAGAGGTTAAATAACTTGTCCAGAGTCACAGCAGTAATGAATGACAGAG 
CTGGGGCTTGAATCCAGGCGTCCTCCTAGAGCCTGGATTCTGTGTAGTGAGTGAAAGCTG 

21796 C ATTCCCAC ATCACCAACAAGAGCAGCGAC ACTCACTCAGAGGTTAAAT AACTTGTCC AG 

AGTCACAGCAGTAATGAATGACAGAGCTGGGGCTTGAATCCAGGCGTCCTCCTAGAGCCT 
GGATTCTGTGTAGTGAGTGAAAGCTGACTCCTGGGAGACTTCTGCGTGGTCCTGGTTCTC 
TCTCCAGACTGCACTGCGCAAGTTTCTCTTCCTGATGGTCCCTAGGGTATTACAAAGACA 
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GTGGCCCTGCCTGTCAGGTGTTTTTATTACCAGATGAGGTCATGGCCTCAGGAACCCTGT 
[A,G] 

GGAAGCTGAGTTCAGAGTCTTTGAGCAGGCTTTAGGGAGGTTCCAGCTTCCCACCACCAA 
GCCCCAGGTGGATTCTTACAGACTCTAGCCTCAGGGTGGGGGGTCTGGAAGATGAGGTTG 
CGGGGTGCGATATTCTGCCCAATTCGCCCCTCCTTGCTCAATCTGTTTCTGCAGGTATTG 
CTGACTACAGACCCAAGGATGGAGAAACCATTGAGCTGAGGCTGGTTAGCTGGTAGCCCC 
TGAGCTCCCTCATCCCAGCAGCCTCGCACACTCCCTAGGCTTCTACCCTCCCTCCTGATG 

22159 CCCAGGTGGATTCTTACAGACTCTAGCCTCAGGGTGGGGGGTCTGGAAGATGAGGTTGCG 
GGGTGCGATATTCTGCCCAATTCGCCCCTCCTTGCTCAATCTGTTTCTGCAGGTATTGCT 
GACTACAGACCCAAGGATGGAGAAACCATTGAGCTGAGGCTGGTTAGCTGGTAGCCCCTG 
AGCTCCCTCATCCCAGCAGCCTCGCACACTCCCTAGGCTTCTACCCTCCCTCCTGATGTC 
CCTGGAACAGGAACTCGCCTGACCCTGCTGCCACCTCCTGTGCACTTTGAGCAATGCCCC 
[C,T] 

TGGGATCACCCCAGCCACAAGCCCTTCGAGGGCCCTATACCATGGCCCACCTTGGAGCAG 
AGAGCCAAGCATCTTCCCTGGGAAGTCTTTCTGGCCAAGTCTGGCCAGCCTGGCCCTGCA 
GGTCTCCCATGAAGGCCACCCCATGGTCTGATGGGCATGAAGCATCTCAGACTCCTTGGC 
AAAAAACGGAGTCCGCAGGCCGCAGGTGTTGTGAAGACCACTCGTTCTGTGGTTGGGGTC 
CTGCAAGAAGGCCTCCTCAGCCCGGGGGCTATGGCCCTGACCCCAGCTCTCCACTCTGCT 

22334 CCCTGAGCTCCCTCATCCCAGC AGCCTCGC ACACTCCCTAGGCTTCTACCCTCCCTCCTG 

ATGTCCCTGGAACAGGAACTCGCCTGACCCTGCTGCCACCTCCTGTGCACTTTGAGCAAT 
GCCCCCTGGGATCACCCCAGCCACAAGCCCTTCGAGGGCCCTATACCATGGCCCACCTTG 
GAGCAGAGAGCCAAGCATCTTCCCTGGGAAGTCTTTCTGGCCAAGTCTGGCCAGCCTGGC 
CCTGCAGGTCTCCCATGAAGGCCACCCCATGGTCTGATGGGCATGAAGCATCTCAGACTC 
[C,A] 

TTGGCAAAAAACGGAGTCCGCAGGCCGCAGGTGTTGTGAAGACCACTCGTTCTGTGGTTG 
GGGTCCTGCAAGAAGGCCTCCTCAGCCCGGGGGCTATGGCCCTGACCCCAGCTCTCCACT 
CTGCTGTTAGAGTGGCAGCTCCGAGCTGGTTGTGGCACAGTAGCTGGGGAGACCTCAGCA 
GGGCTGCTCAGTGCCTGCCTCTGACAAAATTAAAGCATTGATGGCCTGTGGACCTGCTAC 
AGTGGCCTGGTGCCTCATACTCCTCAGGTGCAGGGGCAGGGACAAGAGAAGGGGGAAGTA 

2234 6 TCATCCCAGCAGCCTCGCACACTCCCTAGGCTTCTACCCTCCCTCCTGATGTCCCTGGAA 
CAGGAACTCGCCTGACCCTGCTGCCACCTCCTGTGCACTTTGAGCAATGCCCCCTGGGAT 
CACCCCAGCCACAAGCCCTTCGAGGGCCCTATACCATGGCCCACCTTGGAGCAGAGAGCC 
AAGCATCTTCCCTGGGAAGTCTTTCTGGCCAAGTCTGGCCAGCCTGGCCCTGCAGGTCTC 
CCATGAAGGCCACCCCATGGTCTGATGGGCATGAAGCATCTCAGACTCCTTGGCAAAAAA 
[C,-] 

GGAGTCCGCAGGCCGCAGGTGTTGTGAAGACCACTCGTTCTGTGGTTGGGGTCCTGCAAG 
AAGGCCTCCTCAGCCCGGGGGCTATGGCCCTGACCCCAGCTCTCCACTCTGCTGTTAGAG 
TGGCAGCTCCGAGCTGGTTGTGGCACAGTAGCTGGGGAGACCTCAGCAGGGCTGCTCAGT 
GCCTGCCTCTGACAAAATTAAAGCATTGATGGCCTGTGGACCTGCTACAGTGGCCTGGTG 
CCTCATACTCCTCAGGTGCAGGGGCAGGGACAAGAGAAGGGGGAAGTAACCCCATCAGGG 

2247 6 AC AAGCCCTTCGAGGGCCCTATACCATGGCCCACCTTGGAGCAGAGAGCCAAGCATCTTC 

CCTGGGAAGTCTTTCTGGCCAAGTCTGGCCAGCCTGGCCCTGCAGGTCTCCCATGAAGGC 
CACCCCATGGTCTGATGGGCATGAAGCATCTCAGACTCCTTGGCAAAAAACGGAGTCCGC 
AGGCCGCAGGTGTTGTGAAGACCACTCGTTCTGTGGTTGGGGTCCTGCAAGAAGGCCTCC 
TCAGCCCGGGGGCTATGGCCCTGACCCCAGCTCTCCACTCTGCTGTTAGAGTGGCAGCTC 
[C,T] 

GAGCTGGTTGTGGCACAGTAGCTGGGGAGACCTCAGCAGGGCTGCTCAGTGCCTGCCTCT 
GACAAAATTAAAGCATTGATGGCCTGTGGACCTGCTACAGTGGCCTGGTGCCTCATACTC 
CTCAGGTGCAGGGGCAGGGACAAGAGAAGGGGGAAGTAACCCCATCAGGGAGGAGTGGAG 
GGTGCCTGAGCCGCCATGTGGGCATTGGGGGAGTGATGGGAATGCCAGCAGTGATGACGT 
TGACTACTGACTGAGCACCCACTACTATGACTGAGCACTCACTCGCTAGATACTATCTTG 

23143 GCCGGGCGTGGTGGCTCACGCCTGTAATCCCAGCACTTTAGGAGGCCAAGGCAGGTGGAT 
CACAAGGTCAGGAGTTTGAGATCAGCCTGGCCAACATGGTGAAACTCCATCTTTACTAAA 
AATACAAAAAATTAGCCAGGCATGGTGTTGCATGCCTGCATGCCTGTAATCCCAGTTACT 
TGGGAAGCTGAGGCAGGAGAATTGCTTGAACCCTGGAGGCGGAGGTTGTAGTGAGCCGAG 
ATCACGCCATTGCACTCCAGCTTGGGCAAGAAGAGAAACACTCTCAAAAAAAAAAAAAAA 
[A,-,T] 

CAGGAAACTGGTGCTCAAAAAGGAAAAGTGACTCACCAAGGTCACAGACTAGGCAGTGAT 
GCTGGGGGAACCTGGCTCAGGGGACACAGACCTGGCCTGGGGCAGCCTTGCAGCTCCTCC 
ACTAAAATACTGAAAATGAGGGGCTTCGATGATGGTTATAATCGTATGGCAGAGCCCCAA 
CTCAACTGGAGCCCTGGGACCCAGAAGCTAGGGTCTCACTCCCTGCTTTTCCACAAGGCA 
CCATTAGGGCATCACCCCAGGCCTCGGCAGCCACGACGCAGGGATCCTGCCTCTCATTGG 

23445 AGGAAACTGGTGCTCAAAAAGGAAAAGTGACTCACCAAGGTCACAGACTAGGCAGTGATG 
CTGGGGGAACCTGGCTCAGGGGACACAGACCTGGCCTGGGGCAGCCTTGCAGCTCCTCCA 
CTAAAATACTGAAAATGAGGGGCTTCGATGATGGTTATAATCGTATGGCAGAGCCCCAAC 
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TCAACTGGAGCCCTGGGACCCAGAAGCTAGGGTCTCACTCCCTGCTTTTCCACAAGGCAC 
CATTAGGGCATCACCCCAGGCCTCGGCAGCCACGACGCAGGGATCCTGCCTCTCATTGGT 
[T,CJ 

GGGGGCTTAGGGGCTCTGGGCTGCCCTCTTGAAGAGGGGGTTCAGCCCAGCGAGGCACCC 
CCTATGCTGCACCCCACCAAGGTTAGGAAGAGGTCCTGTCCTCAGTGGGGCCCTCTGATG 
AACAGCCCATCAGGTCTGCGTCCACATGCCTTGGAAGAGATGGTGACATACTCAAAGTCC 
TTGAAGCCGCATATTAAACCACCTAGAGCACCATCTTCAAACATTTAGGGTCTGAGAAGA 
TAGGGGAAGTAAGCAATTTAAAACATTTCTTTATATTGGGCCAGGTGCAATGGCTCACGT 

23974 GGTCTGAGAAGATAGGGGAAGTAAGCAATTTAAAACATTTCTTTATATTGGGCCAGGTGC 
AATGGCTCACGTCTGTAATCCCAGCGCTTTGGGAGGACGAGGATCACCTGAGGTCAGGAG 
TTCAAGATCAGCCTGGCCAACATGGAGAAACCCCATCTCTACTAAAAATACAAAAATTAG 
CTCAGGCGTGGTGATGTGCACCTGTAATCCTAGCTATTCAGGAGGCTGAGGCACAAGAAT 
TGCTTGAGTCAATATTGCACCACTGCACTCCAGCCTGGGCAACAGCGAGACTCTTGTCTC 
[A,-] 

AAAAAAAAAAAAGATATTTGCTGAAAAGACCCAGCCTGCCAAACTCAGGGGCAGCCAAGG 
GAGGTAGTGAAATGGAAGTTGGAGCTCAGCGCTCCCACACCTCCACTGCCCTCAGGCCTT 
CTCTGCCTCTTTCCCATCAGTCAGCTGCTTCTGGGCATGGTCCTGGCAGAGACTTGGCCT 
CCTTCCAGTTCAAGCTCCCTCTTAGATTGTGTCCCACGCCACTGAGTCTTTGGGACACTG 
GGTCAGATGTCTAGTCTGGCACAATTGGCAGGAATCCCAAGAAACAGTGTGAGTGAGGGG 

24865 ATAGCTGGGATTACAGGTGTGCACCACCATGCCCAGCCTAATTTTTGTATTATTAGTAGA 
GATGGGGTTTCACCATGTTGTCCAGGCTGGTCATGAACTCCTGACCTCAAGTGATCCACC 
CGCTTTGGCCTCCCAAAGTGCTGGGATTACAAGCATGAGCCACAGTGCCTGGCCTGACCC 
TGCTCTTTTGAAAGACCATTCCCCCAAATTCTGTGCACCTGTGTGCCTTTCTTCTCTCTG 
CCTCCTCTCAGCTCTGCCCCGCTCTCCTCCCTTCTCCTCTGGCAAATCCCACTCATCTCT 
CG,TJ 

GAAGCCCTTCTTCCAGGGGAAGCCCTGATCATGCTGCTTTCTCCTGTGGGAGGGATGAAG 
GACGTGGCCCACGGAGTTTGTTTTGTTTTGTTTTGAGATGGAGTTTTGCTCATGTTGCCC 
AGGCTGGGGTACAATGGTACGATCTCAGCTCACTGCAACCTCTACGTCCCGGGTTCAAGC 
GGTTCTCCTGCCTTAGCCTCCCCAGTAGCTGGGATTACTGGCATGAACCACCACACCTGG 
CTAATTTTGTGTTTTTAGTAGAGATGGGGTTTCTTCATGTTGGTCAGGCTGGTCTCGAAC 

25040 GACCCTGCTCTTTTGAAAGACCATTCCCCCAAATTCTGTGCACCTGTGTGCCTTTCTTCT 
CTCTGCCTCCTCTCAGCTCTGCCCCGCTCTCCTCCCTTCTCCTCTGGCAAATCCCACTCA 
TCTCTTGAAGCCCTTCTTCCAGGGGAAGCCCTGATCATGCTGCTTTCTCCTGTGGGAGGG 
ATGAAGGACGTGGCCCACGGAGTTTGTTTTGTTTTGTTTTGAGATGGAGTTTTGCTCATG 
TTGCCCAGGCTGGGGTACAATGGTACGATCTCAGCTCACTGCAACCTCTACGTCCCGGGT 
[T,C] 

CAAGCGGTTCTCCTGCCTTAGCCTCCCCAGTAGCTGGGATTACTGGCATGAACCACCACA 
CCTGGCTAATTTTGTGTTTTTAGTAGAGATGGGGTTTCTTCATGTTGGTCAGGCTGGTCT 
CGAACTCCCAACCTCAGGTGATCTGCCTGCCTCGGCCTCCCAAAGTACTGGGATTACAGG 
GTTGAGCCACTGTGCCTGGCCCAGGCCCACGGAGTTTTAAGAGGCTTCCTGTGGCAGTGG 
CATCCAGACGGAGTGCAGAAACTCAAAGTTGAAGGCCAGAAGCTCAGGGAAGGGGGAGTG 

2 504 6 GCTCTTTTGAAAGACCATTCCCCCAAATTCTGTGCACCTGTGTGCCTTTCTTCTCTCTGC 
CTCCTCTCAGCTCTGCCCCGCTCTCCTCCCTTCTCCTCTGGCAAATCCCACTCATCTCTT 
GAAGCCCTTCTTCCAGGGGAAGCCCTGATCATGCTGCTTTCTCCTGTGGGAGGGATGAAG 
GACGTGGCCCACGGAGTTTGTTTTGTTTTGTTTTGAGATGGAGTTTTGCTCATGTTGCCC 
AGGCTGGGGTACAATGGTACGATCTCAGCTCACTGCAACCTCTACGTCCCGGGTTCAAGC 
[G,A] 

GTTCTCCTGCCTTAGCCTCCCCAGTAGCTGGGATTACTGGCATGAACCACCACACCTGGC 
TAATTTTGTGTTTTTAGTAGAGATGGGGTTTCTTCATGTTGGTCAGGCTGGTCTCGAACT 
CCCAACCTCAGGTGATCTGCCTGCCTCGGCCTCCCAAAGTACTGGGATTACAGGGTTGAG 
CCACTGTGCCTGGCCCAGGCCCACGGAGTTTTAAGAGGCTTCCTGTGGCAGTGGCATCCA 
GACGGAGTGCAGAAACTCAAAGTTGAAGGCCAGAAGCTCAGGGAAGGGGGAGTGTGAGTT 

25193 ATCATGCTGCTTTCTCCTGTGGGAGGGATGAAGGACGTGGCCCACGGAGTTTGTTTTGTT 
TTGTTTTGAGATGGAGTTTTGCTCATGTTGCCCAGGCTGGGGTACAATGGTACGATCTCA 
GCTCACTGCAACCTCTACGTCCCGGGTTCAAGCGGTTCTCCTGCCTTAGCCTCCCCAGTA 
GCTGGGATTACTGGCATGAACCACCACACCTGGCTAATTTTGTGTTTTTAGTAGAGATGG 
GGTTTCTTCATGTTGGTCAGGCTGGTCTCGAACTCCCAACCTCAGGTGATCTGCCTGCCT 
[C,T] 

GGCCTCCCAAAGTACTGGGATTACAGGGTTGAGCCACTGTGCCTGGCCCAGGCCCACGGA 
GTTTTAAGAGGCTTCCTGTGGCAGTGGCATCCAGACGGAGTGCAGAAACTCAAAGTTGAA 
GGCCAGAAGCTCAGGGAAGGGGGAGTGTGAGTTGAGGAGTCTCTTGGCTGCCAGGGCCAG 
AAACCGAACTCCAAGCCTCTCCACAACAGCGGGTGTAGAGCATGTAGAATCAGAGAGGAG 
G C T G AGC CATGCAGCCCC GAG AAG AGGGG AAT G C C AC T GAG C C A C AG AG AC C C AG T G C C A 

25529 AGTGCAGAAACTCAAAGTTGAAGGCCAGAAGCTCAGGGAAGGGGGAGTGTGAGTTGAGGA 
GTCTCTTGGCTGCCAGGGCCAGAAACCGAACTCCAAGCCTCTCCACAACAGCGGGTGTAG 
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AGCATGTAGAATCAGAGAGGAGGCTGAGCCATGCAGCCCCGAGAAGAGGGGAATGCCACT 

GAGCCACAGAGACCCAGTGCCACTGCCAGGTGTCTCTGCCTCCACTTCCCATGACCC 

ET # G] 

GCCTGTCTCTGTATGCAGGCTTCACCCTCTCTCGTTGTACATTGTACACATTCTAGGTGA 
CACCAGCAGCTTCTGATTCTCATCTCCCATAACATCAGCCCCCCAGAGAGGGGACAACTG 
CTGAGCTGATAACATAATAGATGCCCCTTTCCTGGAGGCCATGGTCATGGTCAGCGTGGA 
GAGGATGAAGCCTGAGCAGGCAGGATCGGGGGTCTAGAGGGGAAGGAGGTGGAAGTT 

25614 GGCCAGAAGCTCAGGGAAGGGGGAGTGTGAGTTGAGGAGTCTCTTGGCTGCC AGGGCC AG 

AAACCGAACTCCAAGCCTCTCCACAACAGCGGGTGTAGAGCATGTAGAATCAGAGAGGAG 
GCTGAGCCATGCAGCCCCGAGAAGAGGGGAATGCCACTGAGCCACAGAGACCCAGTGCCA 
CTGCCAGGTGTCTCTGCCTCCACTTCCCATGACCCGGCCTGTCTCTGTATGCAGGCTTCA 
CCCTCTCTCGTTGTACATTGTACACATTCTAGGTGACACCAGCAGCTTCTGATTCTCATC 
[C,T] 

C C C AT A AC AT C AGC CC C C C AG AG AG GGG AC AAC T G C T G AGC T GAT AAC AT AAT AG AT GC C 
CCTTTCCTGGAGGCCATGGTCATGGTCAGCGTGGAGAGGATGAAGCCTGAGCAGGCAGGA 
TCGGGGGTCTAGAGGGGAAGGAGGTGGAAGTTGAGATCACAGACCTGTGGTCAGGTGGCC 
TGGGAAGGGTTTGACGAGTGTCGGCCCAAAGAGCTTGGAAGGGATTTTGCTGCTGTGGGT 
GAGCACTGCCTCTCCCCTTAGGGACAACAGCCACCTCTTCTCTCCCCATTTGCCTTTCCC 

2 5785 CCAGTGCCACTGCCAGGTGTCTCTGCCTCCACTTCCCATGACCCGGCCTGTCTCTGTATG 
CAGGCTTCACCCTCTCTCGTTGTACATTGTACACATTCTAGGTGACACCAGCAGCTTCTG 
ATTCTCATCTCCCATAACATCAGCCCCCCAGAGAGGGGACAACTGCTGAGCTGATAACAT 
AATAGATGCCCCTTTCCTGGAGGCCATGGTCATGGTCAGCGTGGAGAGGATGAAGCCTGA 
GCAGGCAGGATCGGGGGTCTAGAGGGGAAGGAGGTGGAAGTTGAGATCACAGACCTGTGG 
[C,T] 

CAGGTGGCCTGGGAAGGGTTTGACGAGTGTCGGCCCAAAGAGCTTGGAAGGGATTTTGCT 
GCTGTGGGTGAGCACTGCCTCTCCCCTTAGGGACAACAGCCACCTCTTCTCTCCCCATTT 
GCCTTTCCCTTCTGTAGATATGAAACACAGGCCTCCTTGTCAGGCCCCTACTTAACCTCC 
GTGATGGGGAAAGCGGCCGGAGAAAGGGAGTTCTGGCAGCTTCTCCGAGACCCCAACACC 
CCACTGTTGCAAGGTGAGTCATGGCCTGACACTCTGGATGTGTCCCCTACCCCAAGCTTA 

26266 GTGATGGGGAAAGCGGCCGGAGAAAGGGAGTTCTGGCAGCTTCTCCGAGACCCCAACACC 
CCACTGTTGCAAGGTGAGTCATGGCCTGACACTCTGGATGTGTCCCCTACCCCAAGCTTA 
CTCAGCCAAGAGGCTTCATCAACTCACCCCAGCTTTCCCTAGCACCCTCCTGGGCCACAC 
CTTCACAAAATCACTGATGCTCAAAGTTGGATATAATATATTGAACTGAAGCCTTAGCAT 
T T T T AT GC AAGT T AC T G T GG AA AT T C T AG G A A AC C AG AC A GAT T AC A AAA AAA AA AAA A A 
[A,-] 

CTAGAAGAAAATTAACATCACCTAGGATATACTACCTAGGAATAACGTCTTTTATTTTGA 
GATGGAGTTTCGCTCTTGTTGCCCAGGCTGGAGTGCAGCGGTATGATCTCGGCTCGCTGC 
AACCTCCGCCTCCTGGGTTCATGTGATTCTTCCACCTCGGCCTTCCTAGAGCCCAAGTGG 
TCTGCCTGCCTCTGCCTCCCAAAGTTCTGGGATTACAGGCATGAGCCACCGCACCCAGCC 
AAAATTACTTAACTTTTCTTCTAGATACTTTTTAAAAATATGGCAGTAAGTTTTTCATAA 
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